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(57) ABSTRACT 

A novel polymorphism in the human PC-1 gene is 
characterized, which is associated with an increased predis- 
position to developing insulin resistance. The polymorphism 
affects heterozygous and homozygous carriers of the allele. 
The subject nucleic acids and fragments thereof, encoded 
polypeptides, and antibodies specific for the polymorphic 
amino acid sequence are useful in determining a genetic 
predisposition to insulin resistance. The encoded protein is 
useful in drug screening for compositions that affect the 
activity of PC-1 and insulin receptor activity or expression. 
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POLYMORPHIC HUMAN PC-1 SEQUENCES 
ASSOCIATED WITH INSULIN RESISTANCE 

CROSS-REFERENCE 

This application claims priority to provisional patent 5 
application No. 60/108,853, filed Nov. 18, 1998. 

Insulin resistance occurs in 25% of non-diabetic, non- 
obese, apparently healthy individuals, and predisposes them 
to both diabetes and coronary artery disease. Diabetes mel- 
lirus is a major health problem in the United States affecting 10 
approximately 7% of the population. The most common 
form of diabetes mellitus is non-insulin-dependent diabetes 
mellitus (NIDDM or type II diabetes). Hyperglycemia in 
type II diabetes is the result of both resistance to insulin in 
muscle and other key insulin target tissues, and decreased 15 
beta cell insulin secretion. Longitudinal studies of individu- 
als with a strong family history of diabetes indicate that the 
insulin resistance precedes the secretory abnormalities. Prior 
to developing diabetes these individuals compensate for 
their insulin resistance by secreting extra insulin. Diabetes 20 
results when the compensatory hyperinsulinemia fails. The 
secretory deficiency of pancreatic beta cells then plays a 
major role in the severity of the diabetes. 

Reaven (1988) Diabetes 37:1595-607 were the first to 2J 
have investigated insulin resistant, non-diabetic, healthy 
individuals from the general population who are non-obese. 
Strikingly, they observed that 25% of them have insulin 
resistance that is of a similar magnitude to that seen in type 
II diabetes patients. These individuals compensate by having 3Q 
insulin levels that are 3-4 times higher than normal. These 
elevated insulin levels are sufficient to maintain normogly- 
cemia. Others have also confirmed that a large proportion of 
the non-diabetic population is insulin resistant. These insulin 
resistant, non -diabetic individuals have a much higher risk 35 
for developing type II diabetes than insulin sensitive sub- 
jects. 

However, even without developing hyperglycemia and 
diabetes, these insulin resistant individuals pay a significant 
price in terms of general health. Insulin resistance results in ^ 
an increased risk for having elevated plasma triglycerides 
(TG), lower high density lipoproteins (HDL), and high 
blood pressure, a cluster of abnormalities that have been 
termed by different investigators as either Syndrome X, the 
insulin resistance syndrome, or the metabolic syndrome. It is 45 
believed that either the hyperinsulinemia, insulin resistance, 
or both play a direct role in causing these abnormalities. 
Data from ethnic, family, and longitudinal studies suggest 
that a major component of resistance is inherited. 

The cellular response to insulin is mediated through the 50 
insulin receptor (IR), which is a tetrameric protein consist- 
ing of two identical extracellular alpha subunits that bind the 
hormone and two identical transmembrane beta subunits that 
have intracellular tyrosine kinase activity. When insulin 
binds to the IR alpha subunit, the beta subunit tyrosine 55 
kinase domain is activated, and insulin action ensues. When 
insulin activates the receptor, the beta subunit is autophos- 
phorylated at the juxtamembrane domain, the tyrosine 
kinase domain and the C-terminal domain. Subsequently, 
endogenous substrates including IRS-1, IRS-2 and SHC are go 
tyrosine phosphorylated. These phosphorylated substrates 
act as docking molecules to activate SH2 domain molecules 
including: GRB-2 which activates the ras pathway; the p85 
subunit of PI-3-kinase; protein tyrosine phosphatase PTP2/ 
SYP; PLCy/NCK; AKT and others. 65 

PC-1 is a class II transmembrane glycoprotein that is 
located both on plasma membranes and in the endoplasmic 
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reticulum. PC-1 is the same protein as liver nucleotide 
pyrophosphatase/alkaline phosphodiesterase I. In addition to 
muscle tissue, PC-1 has been reported to be expressed in 
plasma and intracellular membranes of plasma cells, 
placenta, the distal convoluted tubule of the kidney, ducts of 
the salivary gland, epididymis, proximal part of the vas 
deferens, chondrocytes and dermal fibroblasts. PC-1 exists 
as a disulfide linked homodimer of 230-260 kDa; the 
reduced form of the protein has a molecular size of 115-135 
kDa, depending on the cell type. Human PC-1 is predicted 
to have 873 amino acids. 

PC-1 is inserted into the plasma membrane such that there 
is a small cytoplasmic amino terminus, and a larger extra- 
cellular carboxyl terminus. The extracellular domain of 
PC-1 has a high cysteine region that is involved in dimer 
formation, an ATP binding site and enzymatic activity which 
cleaves sugar-phosphate, phosphosulfate, pyrophosphate, 
and phosphodiesterase linkages. The active enzyme site for 
phosphodiesterase and pyrophosphatase contains a key 
threonine residue, however a mutation of this residue does 
not impair the ability of PC-1 to inhibit IR function. 

Belli et al. (1993) Eur J Biochem 217(l):421-8 discloses 
the existence of enzymatically active water-soluble forms of 
PC-1. Biosynthetic studies revealed a single, monomelic, 
endoglycosidase-H-sensitive membrane PC-1 precursor, 
which was gradually converted to a disulphide-bonded, 
endoglycosidase-H-resistant form. The soluble form of PC-1 
does not appear to arise by proteolytic cleavage from the cell 
surface, although cleavage inside the cell remains a possi- 
bility. The data suggest that the most likely site of cleavage 
is between Pro 152 and Ala 153. 

PC-1 levels are increased in fibroblasts from most patients 
with typical NIDDM and insulin resistance. In addition, 
ovcrcxpression of PC-1 in transfectcd cultured cells reduces 
insulin-stimulated tyrosine kinase activity (Goldfine et al. 
(1998) Mol Cell Biochem 182:177-184). PC-1 content in 
fibroblasts negatively correlates with both decreased in vivo 
insulin sensitivity and decreased in vitro IR autophospho- 
rylation (Frittitta et al. (1998) Diabetes 47:1095-1100). 

In cells from insulin-resistant subjects, insulin stimulation 
of glycogen synthetase was decreased. PC-1 content is also 
elevated in fibroblats, muscle and fat of non-diabetic insulin 
resistant subjects. The elevation of PC-1 content may be a 
primary factor in the cause of insulin resistance, although the 
mechanism by which PC-1 inhibits insulin receptor activity 
is unknown. 

Many mechanisms may potentially contribute to insulin 
resistance. One major mechanism is the impairment of 
insulin receptor tyrosine kinase (IR-TK) activity, a key step 
in insulin receptor signalling. Several inhibitors of IR-TK 
have been associated to insulin resistance. Among them is 
PC-1, a class II transmembrane glycoprotein that is overex- 
pressed in tissues of insulin resistant subjects. The human 
PC-1 gene has been assigned to the same chromosomal 
region (6q22-q23) where both STS D6S290 (which has been 
linked to type 2 diabetes in Mexican-Americans), and the 
gene responsible for transient neonatal diabetes map. The 
identification and characterization of genetic sequences 
involved in insulin resistance is of great medical interest. 

Database References for Genetic Sequences 
The human cDNA and encoded amino acid sequence for 
PC-1 may be accessed in Genbank, M57736 J05654. As a 
reference, the "K" allele is provided herein as SEQ ID NO:l, 
and the encoded polypeptide as SEQ ID NO:2. The "Q" 
allele is provided as SEQ ID NO:3, and the encoded 
polypeptide as SEQ ID NO:4. 
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SUMMARY OF THE INVENTION PC-1 is a class II membrane glycoprotein that inhibits 

„ , . . . . , . ... activation of insulin receptor tyrosine kinase, and is associ- 

Human PC-1 nucleic acids and polypeptides are provided, aled ^ resistance> A novel polynlorphism m cxon 

including promoter and "^on-cxon boundaries. Polymor- 4 f ^ pcM fa sigDificamly collated with insulin 

phic sequences are provided tot encode a form of the The subject genes and fragments thereof, 

protein associated with increased insulin resKtance, where a encodcd { J mc s for ^ ^ 

naturaUy occurring polymorphism of interest comprises a resistance forms of * useful in character- 

lys-glu substitution at position .121 of the protein, m the ^ ^ for ^ t0 ^ resistance Such 

ftSC^^ffi screenmg methods may be used in conjunction with coun- 
and fragments thereof, encoded polypeptides, and antibodies 

specific for the polymorphic amino acid sequence are useful Nucleic Acid Compositions 
in determining a genetic predisposition to insulin resistance. 

The encoded protein is useful in drug screening for compo- As used herein, the term PC-1 genes and encoded 

sitions that affect the activity of PC-1 and insulin receptor polypeptides shall be used to generally designate any of the 

activity or expression. Screening methods that analyze mammalian PC-1 genes and gene products, and unless 

plasma levels of soluble PC-1 are also provided, where otherwise stated will be the human homolog. The sequences 

convenient quantitation of PC-1 content is used in diagnosis °f ^ invention comprise a sequence polymorphism, gen- 

of insulin resistance. erally resulting in a change in coding sequence, that confer 

a susceptibility to insulin resistance, and may lead to hyper- 
BRIEF DESCRIPTION OF THE DRAWINGS 20 glycemia and NIDDM. Such polymorphisms may be generi- 

tti^c lA-imc i • c ™> < „ callv re f erred to herein as a resistance associated PC-1 

FIGS. 1A and IB Sequence analysis of PC-1 exon 4. sequence, or PC-1* Counseling and preventive measures 

Arrows point to nucleotide N. The Q allele sequence is are particularly important for such patients, and early diag- 

depicted in the upper panel, the K allele in the lower The ^ ides infonnation concernirig such a predisposi- 

Avall restriction enzyme recognition site is underlined in the 25 tion ° r 

Q allele sequence. Avail digestion of PC-1 exon 4 amplimers ' „ ^ , ... , 

from 7 different genomic DNAs. The 238 bp amplimer is „ ™ e effect . of a candidate sequence polymorphism on 

completely digested in the QQ sample, resulting two smaller PC '} ex P ressi0D or function may be determmed by kindred 

148 and 90 bp fragments. While KK samples remain analysis for segregation of the sequence vanation with the 
undigested, KQ samples reveal a partial (50%) digestion. 30 dls «f Polype, e.g. insulin resistance, hyperglycemia, 

circ VAA U1 . * ' . ,. etc. A predisposmg mutation will segregate with incidence 

FIGS.2<A) and 12(B> Plasma glucose (A) and muling) of me disease ^ subjecl mu utions generally have a 

levels during an OGTT (75 g) in Q allele carriers (n-33, domirjam phenotvp e, where a single altered allele will 

white circles) and KK subjects (n-68, black circles). confer disease SUSC eptibility. The penetrance will vary with 

§-p<0.05 and *=p<0.01 vs. KK subjects. me specific 

FIG. 3. Insulin receptor autophosphorylation in fibro- ^ an a i teraa tive to kindred studies, biochemical studies 

blasts from Q allele earners (n-5 white circles) and KK are performed to determine whether a candidate sequence 

subjects (n-5. black circles). This function was determined variation m lhe ^ ion or re ^ 0QS affects 

by exposing cells for 10 minutes to increasing insulin the quantity or wnclion of lhe prote ia The effect of a 

concentrations (0-100 nM). Cells were then solubihzed and sequence variation on the interaction between PC-1 and 

the insulin receptor immunocaptured on plastic wells pre- m sulin re ceptor or other tyrosine kinases is determined by 

coated with a monoclonal antibody specific to the insulin bindirig studies or kinase assays, where a decreased level of 

receptor. After washing a biolmylated anliphospholyrosine inhibition or binding is indicative of a predisposing muta- 

antibody was added followed by peroxidase-conjugated tion . Norma i PC -i will inhibit the tyrosine kinase activity of 

streptavidin detection assay. Data are expressed as arbitrary the insu lin receptor, but not other tyrosine kinases, 

units, normalized for protein content. **-p<0.01 vs. KK - nna a ... , f tU . .. . .. . 

subiects embodiment of the invention, polymorphisms of 

interest provide for amino acid substitutions in the extracel- 

DESCRIPTION OF THE SPECIFIC * u * ar domain °f PC-1, particularly the cysteine-rich domain, 

EMBODIMENTS which may substitute a charged amino acid with a neutral 
so amino acid. In one embodiment of the invention the amino 

Methods and compositions are provided for diagnosing a acid substitution is at a lysine residue in this region, e.g. 
predisposition to human insulin resistance. The methods K121 or K119. Polymorphisms at these residues, where the 
comprise an analysis of germline DNA for a predisposing lysine is substituted with any of the other 19 naturally 
polymorphism in the gene encoding PC-1, where presence occurring amino acids, may be referred to generically as a 
of the altered gene confers an increased susceptibility to 55 [*121] PC-1 or [*1 19] PC-1 polymorphisms. Specific poly- 
insulin resistance. Human PC-1 gene and gene product morphisms of interest substitute a neutral amino acid in 
compositions arc provided that encode specific polymorphic place of the lysine, particularly glutamine or arginine. A 
forms of PC-1. Polymorphisms of interest include a coding naturally occurring polymorphism associated with insulin 
change at position 121 of the protein, and polymorphisms of resistance comprises a lys-»glu substitution at position 121 
the 3' UTR. ^ of the protein, herein referred to as "[K121Q] PC-1", or 

In another embodiment of the invention, the concentration merely "[Q] PC-1". 

of soluble PC-1 protein in patient plasma is used as a The human [Q] PC-1 amino acid sequence is provided as 

diagnostic. PC-1 circulates in human plasma and low plasma SEQ ID NO:4, and the encoding gene as SEQ ID NO:3. In 

PC-1 level is independently associated with several features order to identify the subject PC-1 polymorphisms, exonic 
of the insulin resistant "metabolic syndrome" including 65 primers from the published sequence data were used to 

abdominal fat distribution, high blood pressure and, with isolate genomic clones. Sequence data from the genomic 

respect to lipid metabolism, insulin resistance. clones was used to generate specific primers. These primers 
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were used to amplify genomic DNA. The PCR products known in the art, e.g. sequence similarity to known binding 

were screened for mutations using single strand conforma- motifs, gel retardation studies, etc. For examples, see Black- 

tion polymorphism (SSCP) analysis. The specific polymor- well et al. (1995) Mol Med 1: 194-205; Mortlock et al. 

phism found in SEQ ID NO:3 was identified in a number of (1996) Genome Res. 6: 327-33; and Joulin and Richard-Foy 

patients. 5 (1995) Eur J Bioclxem 232: 620-626. Specific regulatory 

DNA encoding a PC-1* protein may be cDNA or genomic motifs are found m toe provided promoter sequence at 

DNA or a fragment thereof that encompasses the altered positions: SEQ ID NO:5; 192-205; and SEQ ID NO:5, 

residue, e.g. [*121] PC-1. As known in the art, cDNA 453-458. 

sequences have the arrangement of exons found in processed The nucleic acid compositions of the subject invention 

mRNA, forming a continuous open reading frame, while 10 encode all or a part of the subject polypeptides. Fragments 

genomic sequences may have introns interrupting the open may be obtained of the DNA sequence by chemically 

reading frame. The term "[*121] PC-1 gene" shall be synthesizing oligonucleotides in accordance with conven- 

intended to mean the open reading frame encoding such tional methods, by restriction enzyme digestion, by PCR 

specific PC-1 polypeptides, as well as adjacent 5' and 3' amplification, etc. For the most part, DNA fragments will be 

non-coding nucleotide sequences involved in the regulation 15 at least about 25 nt in length, usually at least about 30 nt, 

of expression, up to about 1 kb beyond the coding region, in more usually at least about 50 nt. For use in amplification 

either direction. The intron-exon boundaries of the PC-1 reactions, such as PCR, a pair of primers will be used. The 

gene are provided in the examples. exact composition of the primer sequences is not critical to 

Genomic sequences of interest comprise the nucleic acids tne invention, but for most applications the primers will 

present between the initiation codon and the stop codon, 20 hybridize to the subject sequence under stringent conditions, 

including all of the introns that are normally present in a as known in the art. It is preferable to chose a pair of primers 

native chromosome. It may include the 3' and 5' untranslated that wil1 generate an amplification product of at least about 

regions found in the mature mRNA. It may further include 50 nt > preferably at least about 100 nt. Algorithms for the 

specific transcriptional and translational regulatory selection of primer sequences are generally known, and are 

sequences, such as promoters, enhancers, etc., including 25 available in commercial software packages. Amplification 

about 1 kb of flanking geoomic DNA at either the 5' or 3' end primers hybridize to complementary strands of DNA, and 

of the coding region. The genomic DNA may be isolated as ™n prime towards each other. Amplification primers of 

a fragment of 50 kbp or smaller; and substantially free of interest include the inlron sequences flanking each exon, as 

flanking chromosomal sequence. shown in the examples, which may he immediately outside 

The genomic PC-1 5' and 5' sequence, including specific 30 °f lhe «*■ sequence, or may span the actual junction. Use 

transcriptional and translational regulatory sequences, such of sucb P" mers allows S P??* C amplification of the exon 

as promoters, enhancers, etc., including about 1 kb, but sequence from genomic DNA. 

possibly more, of flanking genomic DNA at the 5' end of the The subject PC-1* genes and associated sequences are 

transcribed region, is of particular interest. The promoter 5 isolated and obtained in substantial purity, generally as other 

region is useful for determining the pattern of PC-1 than an intact mammalian chromosome. Usually, the DNA 

expression, e.g. induction and inhibition of expression in will De obtained substantially free of other nucleic acid 

various tissues, and for providing promoters that mimic sequences that do not include a PC-1 sequence or fragment 

these native patterns of expression. A polymorphic PC-1 thereof, generally being at least about 50%, usually at least 

regulatory sequence, i.e. including one or more of the about 90% pure and are typically "recombinant", i.e. flanked 

provided 3' UTR polymorphisms, is useful for expression bv one or more nucleotides with which it is not normally 

studies to determine the effect of sequence variations on associated on a naturally occurring chromosome. 
mRNA expression and stability. The polymorphisms are also 

used as single nucleotide polymorphisms to detect genetic PC-1 Polypeptides 

linkage to phenolypic variation in activity and expression of 45 The subject nucleic acid compositions may be employed 

PC-1. The polymorphic 3' UTR sequences are provided as for producing PC-1* protein, or fragments thereof that 

SEQ ID NO:6 ("A" allele); SEQ ID NO:7 (" P" allele); and encompass a polymorphisms of interest, e.g. [Q121] PC-1. 

SEQ ID NO:8 ("N" allele). The polymorphisms are as For expression, an expression cassette may be employed, 

follows: providing for a transcriptional and translational initiation 

50 region, which may be inducible or constitutive, the coding 
region under the transcriptional control of the transcriptional 
initiation region, and a transcriptional and translational 
termination region. Various transcriptional initiation regions 
may be employed which are functional in the expression 
host. 

The peptide may be expressed in prokaryotes or eukary- 
otes in accordance with conventional ways, depending upon 
The promoter region of PC-1 is provided as SEQ ID the purpose for expression. For large scale production of the 
NO:5. The promoter region is useful for determining natural protein, a unicellular organism or cells of a higher organism, 
patterns of expression, particularly those that may be asso- 60 e -g- eukaryotes such as vertebrates, particularly mammals, 
ciated with disease. Alternatively, mutations may be intro- may be used as the expression host, such as E. coli, B, 
duced into the promoter region to determine the effect of subtilis, S. cerevisiae, and the like. In many situations, it may 
altering expression in experimentally defined systems. The be desirable to express the subject PC-1 gene in a mamma- 
promoter also finds use in the construction of animal models Han host, whereby the PC-1 gene product will be 
where it is desirable to mimic the native patterns of PC-1 65 glycosylated, and secreted. 

expression. Methods for the identification of specific DNA With the availability of the protein in large amounts by 

motifs involved in the binding of transcriptional factors are employing an expression host, the protein may be isolated 
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and purified in accordance with conventional ways. Alysate range of insulin sensitivity in normal individuals, some of 

may be prepared of the expression host and the lysate whose values overlap with similar values in people with 

purified using HPLC, exclusion chromatography, gel diabetes. Therefore, one cannot distinguish between nondia- 

electrophoresis, affinity chromatography, or other purifica- betic and diabetic individuals on the basis of measures of 

tion technique. The purified protein will generally be at least 5 insulin resistance. 

about 80% pure, preferably at least about 90% pure, and may The most widely accepted research method or 'gold 

be up to and including 100% pure. By pure is intended free standard' is the euglycemic insulin clamp technique. With 

of other proteins, as well as of cellular debris. tnis procedure, exogenous insulin is infused, so as to main- 

The polypeptide is used for the production of antibodies, tain a COQSt r ant P^ma insulm level above fasting, while 

where short fragments provide for antibodies specific for the io S lucose 1S fixe f al abasal lev r eI Y*™** g' ucose at 

particular polypeptide, and larger fragments allow for the ^"f. rales ' ™ IS S lucose inf , us,on 18 de ,vered ™ an 

production of antibodies over the surface of the protein. indwelling catheter at a rate ^ based on plasma glucose 

Antibodies may be raised to the normal or insulin resistant .^remen^ every 5 mm. When the plasma glucose level 

forms of KM. Of particular interest are antibodies that faUs ^ basa *' ^ glucose mfusion rale is increased* 

specifically recognize the insulin resistant forms of the 15 returQ P 1 ™* t0 basal levels and yice versa. The 

protein, i.e. the antibodies do not bind to the normal form. a ™™t of g^ose infused over time (M value) is an index 

Also of interest are antibodies that recognize the soluble ° f l0 * uhn a * l ? n ° n & ucosc metabolism. The more glucose 

forms of the protein. Antibodies may be raised to isolated lbat has t0 be mfused f er ™* time then the more sensitive 

peptides corresponding to these mutations, or to the native patient * to insu in. Conversely, the insulin-resisUnt 

protein, e.g. by immunization with cells expressing PC-1, 20 PfHent requires much less glucose to maintain basa plasma 

immunization with liposomes containing PC-1 , etc. Such & lucose lev f * ^ e of 1D f ^ on "jetabohsm can 

antibodies are useful in therapy and diagnosis. be ass f sed in tne absence ° f ! he confounding effects of 

, ., ,. , . , . , , hypoglycemic counterregulation, endogenous insulin 

Antibodies are prepared in accordance with conventional Qf vafiab , e h ^ \ mu i tip i e 

ways, where the expressed polypeptide or protein is used as msulin actioQS caQ be ^ b ^ • mdudin 

an immunogen, by itsetf or conjugated to known immuno- ]&xion of lucosc takc and productioD) inhibition <$ 

gemc earners, e.g. KLH, pre -S HBsAg other viral or B , ^ ^ ch m b mctabolism- 

eukaryotic protems, or the like. Vanous adjuvants may be . „ . . . . , , ... , 

, . ... . .... . t ' An alternative is the minimal model. With this procedure, 

employed, with a senes of injections, as appropriate. For . . . .. . . f . • . 

monoclonal antibodies, after one or more booster injections, P ucos * and H msulm ™ \ *?" "f^?" 

the spleen is isolated, the splenocytes immortalized, and 30 "g^ter during an , intravenous glucose ( tolerance test; he 

then screened for high affinity antibody binding. Tne immor- reSU * a ' e t eQtered f° % COm P U " v / ^!S?T 

talized cells, e.g. hybridomas, producing the desired anti- a V f e *** * ™ indc ? ^ Ued Sl > ™ e 

bodies may then be expanded. For further description, see aCUte ^ ^ ,n S^ose is also 

Monoclonal Antibodies: A Laboratory Manual, Harlow and dete « nined b ? the * est - ™» measure of insulin resistance 

Lane eds., Cold Spring Harbor Laboratories, Cold Spring 35 c ^ elales rea f° P* 1 * ™. U Wltb lhe mic msulm 

Harbor, N.Y., 1988. If desired, the mRNA encoding the ^P?, ^ondiabetic subjects Its accuracy deteriorates in 

heavy and light chains may be isolated and mutagenized by diabc CS bec T? 1 Ulc im ^ ediate P la ^ ma msu i m rcs ^. DSe l ° 

cloning in E coli, and the heavy and light chains mixed to the * lucose challe ^ " dunm f hed - ™ e f ef ° re ' addltl ° oal 

further enhance the affinity of the antibody. Alternatives to maneU ^ are ne f ded 10 raise P^a insulin levels, such as 

in vivo immunization as a method of raising antibodies 40 g'V>ng tolbutamide or exogenous insulin in the course of the 

include binding to phage display libraries, usually in con- es ^ . , , 

junction with in vitro affinity maturation. ^ most P ractlcal wa y assessing insuhn resistance is 

the homeostasis model assessment (HOMAIR), involving 

Phenotypic Indications fasting insulin and glucose levels. This value is calculated as 

45 fasting plasma insulin (u/ml)xfasling plasma glucose 

Insuhn resistance is an essential feature of a great variety (mmol/L)/22.5 (Matthews et al. (1985) Diabetologia. 

of clinical disorders in addition to diabetes, including coro- 28:412-9). The steady-state basal plasma glucose and insu- 

nary artery disease, hyperlipidemia, obesity and hyperten- Jj n concentrations are determined by their interaction in a 

sion. Individuals with non-insulin dependent diabetes have feedback loop. A computer-solved model is been used to 

insulin resistance in peripheral tissues. They have a subnor- 50 predict the homeostatic concentrations which arise from 

mal glucose utilization in skeletal muscle, where glucose varying degrees beta-cell deficiency and insulin resistance, 

transport across the cell membrane of skeletal muscle is the Comparison of a patient's fasting values with the model's 

rate limiting step in glucose metabolism. In adipose and predictions allows a quantitative assessment of the contri- 

muscle cells, insulin stimulates a rapid and dramatic butions of insulin resistance and deficient beta-cell function 

increase in glucose uptake, primarily by promoting the 55 to the fasting hyperglycaemia. The estimate of insuhn resis- 

redistribution of the GLUT4 glucose transporter from its t anC e obtained by homeostasis model assessment correlates 

intracellular storage site to the plasma membrane. Impaired with estimates obtained by use of the euglycaemic clamp, 

glucose tolerance (IGT) is associated with a normal fasting the fasting insulin concentration, and the hyperglycaemic 

blood glucose but an elevated postprandial blood sugar clamp. The lower limit of the top quintile of HOMA(IR) 

between 7.8 and 11 mmol/L (140 and 199 mg/dL). Some 50 distribution (i.e. 2.77) in nonobese subjects with no meta- 

patients with IGT are hyperinsulinimic, and progress to bolic disorders has been chosen as the threshold for insulin 

MODM- resistance in some studies (Bonora et al. (1998) Diabetes 

The response to insulin has been measured by a number 47:1643-9). The results of this study documented that 1) in 

of different methods, and insuhn resistance has been quan- hypertriglyceridemia and a low HDL cholesterol state, insu- 

tified by a number of different indices. A variety of proce- 65 hn resistance is as common as in NIDDM, whereas it is less 

dures have been developed to detect the presence of insuhn frequent in hypercholesterolemia, hyperuricemia, and 

resistance. Using any of these techniques, there is a wide hypertension; 2) the vast majority of subjects with multiple 
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metabolic disorders are insulin resistant; 3) in isolated phycoerythrin, allopbycocyanin, 6-carboxyfluorescein 

hypercholesterolemia, hyperuricemia, or hypertension, insu- ( 6 - FAM ) , 2\7'-dimethoxy-4\5'-dichloro-6- 

lin resistance is not more frequent than can be expected by carboxyfluorescein (JOE), 6-carboxy-X-rhodamine (ROX), 

chance alone; and 4) in the general population, insulin e-carboxy^'^'.T'^J-hexachlorofluorescein (HEX), 

resistance can be found even in the absence of any major 5 5-carboxyfluorescein (5 -FAM) or N,N,N',NMetramethyl-6- 

metabolic disorders. carboxyrhcxiamine (TAMRA), radioactive labels, e.g. 32 p, 

The measurement of insulin concentration can be done in 35 S» 3 Hi etc - The la bel ma y be a two stage system, where the 

the overnight fasted condition, since in the postprandial amplified DNA is conjugated to biotin, haptens, etc. having 

stale, glucose levels are changing rapidly and the variable a m S a affinity binding partner, e.g. avidin, specific 

levels of glucose confound the simultaneous measure of 10 antibodies, etc., where the binding partner is conjugated to 

insulin levels as an index of insulin action. There is a a detectable label. The label may be conjugated to one or 

significant correlation between fasting insulin levels and both of the primers. Alternatively, the pool of nucleotides 

insulin action as measured by the clamp technique. Very used m tne amplification is labeled, so as to incorporate the 

high plasma insulin values in the setting of normal glucose ' aDe ^ ml ° tne amplification product, 

levels are very likely to reflect insulin resistance. As indi- 15 ^ nudeic ^ genomic DNA , amplifica- 

viduals develop diabetes, plasma glucose increases and tion producl or doned fragrnent> is analyzed by one of a 

plasma insulin decreases and so the plasma insulin level no num5er of melnods ^ovm in the art. The nucleic acid may 

longer reflects only insulin resistance because it becomes be sequenced D y dideoxy or other methods, and the sequence 

influenced by the appearance of a B-cell defect and hyper- of bases compared to a wi i d . typ e PC -1 sequence. Hybrid- 

glycemia. ization with the variant sequence may also be used to 

Detection of PC-1 Associated Insulin Resistance ^T^' [}* P rc « nCC ' f by South f ni ^ d0t WotS ' e!c - Thc 

hybridization pattern of a control and vanant sequence to an 

DNA from a patient having insulin resistance, as array of oligonucleotide probes immobilised on a solid 

described above, suspected of association with aberrant support, as described in U.S. Pat. No. 5,445,934, or in 

PC-1 is analyzed for the presence of an IR polymorphism. WO95/35505, may also be used as a means of detecting the 

Genetic characterization analyzes DNA or RNA, from any presence of variant sequences. Single strand conformational 

source, e.g. skin, cheek scrapings, blood samples, etc. The polymorphism (SSCP) analysis, denaturing gradient gel 

nucleic acids are screened for the presence of an insulin electrophoresis (DGGE), and heteroduplex analysis in gel 

resistant polymorphism, e.g. SEQ ID NO:3, as compared to matrices are used to detect conformational changes created 

a normal sequence (SEQ ID NO:l, SEQ ID NO:2). by DNA sequence variation as alterations in electrophoretic 

A number of methods are available for analyzing nucleic mobility, 

acids for the presence or absence of a specific sequence. Alternatively, where a polymorphism creates or destroys 

Where large amounts of DNA are available, genomic DNA a recognition site f or a restriction endonuclease, the sample 

is used directly. Analysis of genomic DNA may use whole 35 ^ digested ^ that endomJC lease, aod the products ^ 

chromosomes or fractionated DNA, e.g Southern blots etc. f ract ionated to determine whether the fragment was 

Combative Genomic Hybridization (CGH) as described digestcd Fractionation is per f 0 nned by gel or capillary 

in U.S. Pat. No. 5,665,549 provdes methods for determin- electrophoresis, particularly acrylamide or agarose gels. The 

ing the relative number of copies of a genomic sequence. [Q] p(M allde has an Avail site tnat k not Qt m thc [K] 

The intensity of the signals from each labeled subject ^ PC .l aUele, and this difference may be exploited for genetic 

nucleic acid and/or the differences in the ratios between screening 
different signals from the labeled subject nucleic acid 

sequences are compared to determine the relative copy Changes in the promoter or enhancer sequence that may 

numbers of the nucleic acid sequences as a function of affect expression levels of PC-1 can be compared to expres- 

position along the reference chromosome spread. Other 45 sion levels of normal allele by various methods known 

methods for fluorescence in situ hybridization are known in m toe Methods for determining promoter or enhancer 

the art, for a review, see Fox et al. (1995) Clin Chem strength include quantitation of the expressed natural pro- 

41(11): 1554-1559. tein; insertion of the variant control element into a vector 

Alternatively, the region of interest is cloned into a wi,h a rc P orter & ene such as p-galactosidase, luciferase, 
suitable vector and grown in sufficient quantity for analysis. 50 chloramphenicol acetyltransferase, etc. that provides for 
Cells that express PC-1 may be used as a source of mRNA, convenient quantitation; and the like, 
which may be assayed directly or reverse transcribed into Diagnostic screening may also be performed for poly- 
cDNA for analysis. The nucleic acid may be amplified by morpbisms that are genetically linked to a predisposition for 
conventional techniques, such as the polymerase chain reac- pC-1 associated insulin resistance, particularly through the 
tion (PCR), to provide sufficient amounts for analysis. The 55 use D f rnicrosatellite markers, e.g. the variable repeat in 
use of the polymerase chain reaction is described in Saiki, et mu - 0 n 3, or single nucleotide polymorphisms, e.g. the 3' 
al. (1985) Science 239:487, and a review of techniques may UTR polymorphisms. Frequently the rnicrosatellite poly- 
be found in Sambrook, et al. Molecular Cloning: A Labo- morphism itself is not phenotypically expressed, but is 
ratory Manual, CSH Press 1989, pp. 14. 2-14. 33. linked to sequences that result in a disease predisposition. 
Alternatively, various methods are known in the art that go However, in some cases the rnicrosatellite sequence itself 
utilize oligonucleotide ligation as a means of detecting may affect gene expression. Microsatellite linkage analysis 
polymorphisms, for examples see Riley et al. (1990) NA.R. may be performed alone, or in combination with direct 
18:2887-2890; and Delahunty et al. (1996)4m. /. Hum. detection of polymorphisms, as described above. The use of 
Genet. 58:1239-1246. microsatellite markers for genotyping is well documented. 

A detectable label may be included in an amplification 65 For examples, see Mansfield et al. (1994) Genomics 

reaction. Suitable labels include fluorochromes, e.g. fluo- 24:225-233; Ziegle et al. (1992) Genomics 14:1026-1031; 

rescein isothiocyanate (FITC), rhodamine, Texas Red, Dib et al., supra. 



US 6,465,185 Bl 

11 12 

Microsalellite loci thai are useful in the subject methods lymph, dialysis fluid and the like; organ or tissue culture 

have the general formula: derived fluids; and fluids extracted from physiological tis- 
sues. Also included in the term are derivatives and fractions 

u w*W3 0 f suc0 fluids. Th e ceils may be dissociated, in the case of 

where U and U' are non-repetitive flanking sequences that 5 so 1 " 1 tissues, or tissue sections may be analyzed. Alteraa- 

uniquely identify the particular locus, R is a repeat molif, tivel y a of the celk ma y bc prepared, 

and n is the number of repeats. The repeat motif is at least Diagnosis may be performed by a number of methods to 

2 nucleotides in length, up to 7, usually 2-4 nucleotides in determine the absence or presence or altered amounts of 

length. Repeats can be simple or complex. The flanking normal or PCM in patient [cells. For example, detection may 

sequences U and U' uniquely identify the microsalellite io utJl * > stainm S f or .^gical secUons, performed in 

toc\is within the human genome. U and U' are at least about f^T^ T f' "! £T f 

, 0 . ... . , * , , , LJJ uzed 10 stain cytoplasmic molecules. The antibodies of 

18 nucleotides in ength, and may extend several hundred are ^ £ ^ ^ ^ fof 

bases up to about 1 kb on either side of the repeat. Within riod of time sufficiem tQ allo / binding to lhe ilo 

U and U', sequences are selected for amplification primers. usually at , east about 10 mmules ^ ^ £ 

The exact composiUon of the primer sequences are not is labeled with radioisotopes, enzymes, fluoresces, 

critical to the invention, but they must hybridize to the chemiluminescers, or other labels for direct detection. 

flanking sequences U and U', respectively, under stringent Alternatively, a second stage antibody or reagent is used to 

conditions. Criteria for selection of amplification primers are amplify the signal. Such reagents are well known in the art. 

as previously discussed. To maximize the resolution of size For example, the primary antibody may be conjugated to 

differences at the locus, it is preferable to chose a primer 20 biotin, with horseradish peroxidase-conjugated avidin added 

sequence that is close to the repeat sequence, such that the as a second stage reagent. Alternatively, the secondary 

total amplification product is between 100-500 nucleotides antibody conjugated to a fiourescent compound, e.g. 

in length. fiourescein, rhodamine, Texas red, etc. Final detection uses 

The number of repeats at a specific locus, n, is polymor- a substrate that undergoes a color change in the presence of 

phic in a population, thereby generating individual differ- 25 lhe peroxidase. The absence or presence of antibody binding 

cnces in the length of DNA that lies between the amplifi- mav De determined by various methods, including flow 

cation primers. The number will vary from at least 1 repeat cytometry of dissociated cells, microscopy, radiography, 

to as many as about 100 repeats or more. scintillation counting, etc. 

The primers are used to amplify the region of genomic An alternative method for diagnosis depends on the in 

DNA that contains the repeals. Conveniently, a detectable 30 d ^ oa of b ^ b ^ tween ******* and polymor- 

, •„ . ■ 1.^,1 ,„ t , phic PC-1" in a lysate. Measuring the concentration of 

label wdl be mcluded m the amphfication reaction, as fc^* bin( ^ ^ > k fr * { 

previously described. Mul iplex amplification may be per- accomplished by a variety of specific assays. A conventLal 

formed in which several sets of primers are combined in the sandwich t be ^ Fof ^ { & sandwich 

same reaction tube. This is particularly advantageous when assay may fil5l attach pc .^ dfic J [ibodics t0 ^ 

limited amounts of sample DNA are available for analysis. 35 insoluble surface or support. Patient sample lysates are then 

Conveniently, each of the sets of primers is labeled with a added to the supports (for example, separate wells of a 

different fiuorochrome. microtiter plate) containing antibodies. Preferably, a series 

After amplification, the products arc size fractionated. of standards, containing known concentrations of normal 

Fractionation may be performed by gel electrophoresis, and/or PC-1* is assayed in parallel with the samples or 

particularly denaturing acrylamide or agarose gels. A con- 40 aliquots thereof to serve as controls. The quantitation may 

venient system uses denaturing polyacrylamide gels in com- then be performed by adding a labeled antibody specific for 

bination with an automated DNA sequencer, see Hunkapillar PC-1* Other immunoassays are known in the art and may 

et al. (1991) Science 254:59-74. The automated sequencer is nQ d use as diagnostics. Ouchterlony plates provide a simple 

particularly useful with multiplex amplification or pooled determination of antibody binding. Western blots may be 

products of separate PCR reactions. Capillary electrophore- 45 performed on protein gels or protein spots on filters, using 

sis may also be used for fractionation. A review of capillary a detection system specific for PC-1 as desired, conveniently 

electrophoresis may be found in Landers, et al. (1993) using a labeling method as described for the sandwich assay. 

BioTechniques 14:98-111. The size of the amplification Immunoassays may also be used in the detection of 

product is proportional to the number of repeats (n) that are 10 the , P lasma °[ a P*« nt ' quantitative 

presentatthelocusspecmedby the primers Thesize will be 50 f ^^^^^ Panned. It « found that 

polymorphic in the population, and is therefore an allelic decreased levels of PC-1 in the plasma are associated with 

Ir f ht l increased levels in the muscle, therefore a relatively low titer 

mancer lor mat locus. . is associated with insulin resistance. In addition, the soluble 

Screening for polymorphisms in PC-1 may be based on pc.j may be ana i yzed for the presence or a predisposing 
the functional or antigenic characteristics of the protein. polymorphism, e.g. that Q121 protein 
Protein truncation assays are useful in detecting deletions 55 A ^ may ^ prov ided for practice of the subject diag- 
that may affect the biological activity of the protein. Various D0Stic methods Such a kit mntaia hybridizatioD rob ° s 
immunoassays designed to detect polymorphisms in PC-1 that bind to a polymorphic P(M * sequence under hybrid- 
proteins may be used m screening. Where many diverse conditions where ±c pro5e does nol bind to a ^ 
genetic mutations lead to a particular disease pbenotype, t seqfm0m Alternatively, antibodies specific for a 
functional protein assays have proven to be effective screen- 60 polymorphic PC-1* polypeptide may be included. Such a kit 
mg tools, for example by detecting the specific phosphatase ^ t icalI mdude itivc Md ivc nuckic add 0f 
activity on a PC-1 substrate. Alternatively, changes in elec- polypeptide controls, and such other buffers and reagents as 
trophoreuc mobility may be used be oe tQ ice ^ memod 

Antibodies specific for an PC-1 polymorphism may be 

used in staining or in immunoassays. Samples, as used 65 Modulation of Gene Expression 

herein, include cells, e.g. biopsy samples, biological fluids The PC-1 genes, gene fragments, or the encoded protein 

such as semen, blood, cerebrospinal fluid, tears, saliva, or protein fragments are useful in gene therapy to treat 
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disorders associated with PC-1 insulin resistance. Expres- intracellular stability and binding affinity. Among useful 

sion vectors may be used to introduce a PC-1 gene into a changes in the backbone chemistry are phosphorothioates; 

cell. Such vectors generally have convenient restriction sites phosphorodithioates, where both of the non-bridging oxy- 

located near the promoter sequence to provide for the gens are substituted with sulfur; pbosphoroamidites; alkyl 

insertion of nucleic acid sequences. Transcription cassettes 5 pbospbotriesters and boranophosphates. Achiral phosphate 

may be prepared comprising a transcription initiation region, derivatives include 3'-0*-5<-S-phosphorothioate, 3'-S-5'-0- 

the target gene or fragment thereof, and a transcriptional phosphorothioate, 3'-CH2-5'-0-phosphonate and 3'-NH-5'- 

termination region. The transcription cassettes may be intro- O-phosphoroamidate. Peptide nucleic acids replace the 

duced into a variety of vectors, e.g. plasmid; retrovirus, e.g. entirc ribose phosphodiester backbone with a peptide link- 

lenti virus; adenovirus; and the like, where the vectors are 10 a ge. 

able to transiently or stably be maintained in the cells, Sugar modifications are also used to enhance stabibty and 

usually for a period of at least about one day, more usually affinity. The a-anomer of deoxyribose may be used, where 

for a period of at least about several days to several weeks. the base is inverted with respect to the natural b-anomer. The 

The gene or PC-1 protein may be introduced into tissues 2 '-°H of J?»" ™& u ma >\ ■* ***** lo form 2 '°- 

or host cells by any number of routes, including viral 15 methyl or 2-O-allyl sugars, which provides resistance to 

infection, microinjection, or fusion of vesicles. Jet injection degradation without comprising affinity, 

may also be used for intramuscular administration, as Modification of the heterocyclic bases must maintain 

described by Furth et al. (1992) Anal Biochem 205:365-368. P ro P er base pairing. Some useful substitutions include deox- 

The DNA may be coated onto gold microparticles, and yuridine for deoxy thymidine; S-methyl-J-deoxycytidine 

delivered intradermal^ by a particle bombardment device, 20 and 5-bromo-2'-deoxycytidine for deoxycytidine. 

or "gene gun" as described in the literature (see, for 5-propynyl-2 , -deoxyuridine and 5-propynyl-2'- 

example, Tang et al. (1992) Nature 356:152-154), where deoxycytidine have been shown to increase affinity and 

gold microprojectiles are coated with PC-1 protein or biological activity when substituted for deoxy thymidine and 

nucleic acids encoding PC-1, then bombarded into skin deoxycytidine, respectively. 

cells. 25 The antisense molecules and/or other inhibitory agents are 
Antiscnsc molecules can be used to down-regulate administered by contact with cells under conditions that 
expression of PC-1 in cells. The anti-sense reagent may be P ermit entr y- The molecules may be provided in solution or 
antisense oligonucleotides (ODN), particularly synthetic in an y other pharmacologically suitable form for 
ODN having chemical modifications from native nucleic administration, such as a liposome suspension. There are 
acids, or nucleic acid constructs that express such anti-sense 30 man y de hvery methods known in the art for enhancing the 
molecules as RNA The antisense sequence is complemen- u P lake of nucleic acids b X cells - Useful deliver y s y slems 
tary to the mRNA of the targeted gene, and inhibits expres- mdude Sendai virus-hposome delivery systems (see Rapa- 
sion of the targeted gene products. Antisense molecules P orl aod Shai < 1994 ) J BU>1 Chem - 269:15124-15131), 
inhibit gene expression through various mechanisms, e.g. by cationic liposomes, polymenc delivery gels or matrices, 
reducing the amount of mRNA available for translation, 35 P orous baUoon cath eters (as disclosed by Shi et al. (1994) 
through activation of RNAse H, or steric hindrance. One or Circulation 90:955-951; and Shi et al. (1994) Gene Therapy 
a combination of antisense molecules may be administered, 1:408-414), retrovirus expression vectors, and the like, 
where a combination may comprise multiple different The use of Uposomes as a delivery vehicle is one method 
sequences. °f interest. The liposomes fuse with the cells of the target 
Alternatively, the antisense molecule is a synthetic ofigo- 40 f ite and deliver the contem f of the lumen ^acejlularly. The 
nucleotide. Antisense oligonucleotides will generally be at h P^° me& . are ^fained in contact with the cells for 
least about 7, usually at least about 12, more usually at least sufficient tm ? e for using various means to maintain 
about 20 nucleotides in length, and not more than about 500, J° ntact ' such ablation binding agents, and the like, 
usually not more than about 50, more usually not more than A < ^P osomes ™y be prepared with purified proteins or pep- 
about 35 nucleotides in length, where the length is governed 45 tldej 4 hat mediate ftlslon £ f membranes, such as Sendai virus 
by efficiency of inhibition, specificity, including absence of ? T virus, etc. The lipids may be any useful com- 
cross-reactivity, and the like. It has been found that short bmatl0D ° f knt ™ n Imosome forming lipids, including cat- 
oligonucleotides, of from 7 to 8 bases in length, can be ! OI " c such 1 «* phosphatidylcholine The remaining 
strong and selective inhibitors of gene expression (see 5n h f ld ^(normally be neutral lipids, such as cholesterol, 
Wagner et al. (1996) Nature Biotechnology 14:840^44). 50 Phosphatidyl senne, phosphatidyl glycerol, and the like. 

. . c . . » , The therapeutic agents are administered at a dose effective 

«rl^A K jr.„« 't^Zln L l 10 redu « expression tevel of PC-1* at leas. abou. 50%. more 

strand mRNA sequence is chosen to be complemented by ^ / preferably lo substantially unde- 

the antisense sequence, preferably encompassing the [Q121] tectable levels 

PC-1 mutation. Selection of a specific sequence for the 55 

oligonucleotide may use an empirical method, where several Genetically Modified Cells and Animals 

candidate sequences are assayed for inhibition of expression The subject nucleic acids can be used to generate trans- 

of the target gene in an in vitro or animal model. A gemc animals or site specific gene modifications in cell 

combination of sequences may also be used, where several ii nes . Transgenic animals may be made through homologous 

regions of the mRNA sequence are selected for antisense m recombination. Alternatively, a nucleic acid construct is 

complementation. randomly integrated into the genome. Vectors for stable 

Nucleic acids may be naturally occurring, e.g. DNA or integration include plasmids, retroviruses and other animal 

RNA or may be synthetic analogs, as known in the art. Such viruses, YACs, and the like. The modified cells or animals 

analogs may be preferred for use as probes because of are useful in the study of PC-1 function and regulation. A 

superior stability under assay conditions. Modifications in 65 detectable marker, such as lac Z may be introduced into the 

the native structure, including alterations in the backbone, PC-1 locus, where upregulation of PC-1 expression will 

sugars or heterocyclic bases, have been shown to increase result in an easily detected change in phenotype. 
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DNA constructs for homologous recombination will com- interaction with proteins, particularly hydrogen bonding, 

prise at least a portion of a polymorphic PC-1* gene with the and typically include at least an amine, carbonyl, hydroxyl 

desired genetic modification, and will include regions of or carboxyl group, preferably at least two of the functional 

homology to the target locus. DNA constructs for random chemical groups. The candidate agents often comprise cycli- 

integration need not include regions of homology to mediate 5 cal carbon or heterocyclic structures and/or aromatic or 

recombination. Conveniently, markers for positive and polyaromatic structures substituted with one or more of the 

negative selection are included. Methods for generating cells above functional groups. Candidate agents are also found 

having targeted gene modifications through homologous among biomolecules including peptides, saccharides, fatty 

recombination are known in the art. For various techniques acids, steroids, purines, pyrimidines, derivatives, structural 

for transfecting mammalian cells, see Keown et al. (1990) 10 analogs or combinations thereof. 

Methods in Enzymology 185:527-537. Candidate agents are obtained from a wide variety of 
For embryonic stem (ES) cells, an ES cell line may be sources including libraries of synthetic or natural corn- 
employed, or ES cells may be obtained freshly from a host, pounds. For example, numerous means are available for 
e.g. mouse, rat, guinea pig, etc. Such cells are grown on an random and directed synthesis of a wide variety of organic 
appropriate fibroblast-fecder layer or grown in the presence 15 compounds and biomolecules, including expression of ran- 
of leukemia inhibiting factor (LIF). When ES cells have domized oligonucleotides and oligopeptides. Alternatively, 
been transformed, they may be used to produce transgenic libraries of natural compounds in the form of bacterial, 
animals. After transformation, the cells are plated onto a fungal, plant and animal extracts are available or readily 
feeder layer in an appropriate medium. Cells containing the produced. Additionally, natural or synthetically produced 
construct may be detected by employing a selective 20 libraries and compounds are readily modified through con- 
medium. After sufficient time for colonies to grow, they are ventional chemical, physical and biochemical means, and 
picked and analyzed for the occurrence of homologous may be used to produce combinatorial libraries. Known 
recombination or integration of the construct. Those colo- pharmacological agents may be subjected to directed or 
nies that are positive may then be used for embryo manipu- random chemical modifications, such as acylation, 
lation and blastocyst injection. Blastocysts are obtained 25 alkylation, esterification, amidification, etc. to produce 
from 4 to 6 week old supcrovulatcd females. The ES cells structural analogs. 

are trypsinized, and the modified cells are injected into the where the screening assay is a binding assay, one or more 
blastocoel of the blastocyst. After injection, the blastocysts 0 f foe molecules may be joined to a label, where the label 
are returned to each uterine horn of pseudopregnant females. can directly or indirectly provide a detectable signal. Various 
Females are then allowed to go to term and the resulting 3° , abel& indude radioisotopes> fluoresces, chemiluminescers, 
Utters screened for mutant cells having the construct. By enzymes, specific binding molecules, particles, e.g. mag- 
providing for a different phenotype of the blastocyst and the net j c particles, and the like. Specific binding molecules 
ES cells, chimeric progeny can be readily detected. incUK i e pairs> sucn ^ biolin and strep tavidin, digoxin and 
The chimeric animals are screened for the presence of the antidigoxin etc. For the specific binding members, the 
modified gene and males and females having the modifica- 35 complementary member would normally be labeled with a 
tion are mated to produce homozygous progeny. The trans- molecule that provides for detection, in accordance with 
genie animals may be any non-human mammal, such as known procedures. 

laboratory animals, domestic animals, etc. The transgenic a variety of other reagents may be included in the 

animals may be used in functional studies, drug screening, screening assay. These include reagents like salts, neutral 

etc., e.g. to determine the effect of a candidate drug on proteins, e.g. albumin, detergents, etc that are used to 

insulin resistance. facilitate optimal protein -protein binding and/or reduce non- 

tx o A specific or background interactions. Reagents that improve 

» b j me efficiency of the assay, such as protease inhibitors, 

Drug screening identifies agents inhibit or otherwise 45 nuclease inhibitors, anti-microbial agents, etc. may be used, 

modulate PC-1 function in cells. Of particular interest are The mixture of components are added in any order that 

screening assays for agents that have a low toxicity for provides for the requisite binding. Incubations are per- 

human cells. A wide variety of assays may be used for this formed at any suitable temperature, typically between 4 and 

purpose, including labeled in vitro protein-protein binding 40° C. Incubation periods are selected for optimum activity, 

assays, eleclrophoretic mobility shift assays, immunoassays 50 but may also be optimized to facilitate rapid high-throughput 

for protein binding, and the like. The purified protein may screening. Typically between 0.1 and 1 hours will be suffi- 

also be used for determination of three-dimensional crystal cient. 

structure, which can be used for modeling intermolecular The compounds having the desired pharmacological 

interactions, transporter function, etc. activity may be administered in a physiologically acceptable 

The term "agent" as used herein describes any molecule, 55 carrier to a host for treatment of insulin resistance or 

e.g. protein or pharmaceutical, with the capability of altering hyperglycemia attributable to a defect in PC-1 function. The 

or mimicking the physiological function of PC-1. Generally compounds may also be used to inhibit PC-1 function in 

a plurality of assay mixtures are run in parallel with different resistance to insulin, etc. The inhibitory agents may be 

agent concentrations to obtain a differential response to the administered in a variety of ways, orally, topically, parenter- 

various concentrations. Typically, one of these concentra- 60 au y e.g. subcutaneously, intraperitoneally, by viral infection, 

tions serves as a negative control, i.e. at zero concentration intravascularly, etc. Topical treatments are of particular 

or below the level of detection. interest. Depending upon the manner of introduction, the 

Candidate agents encompass numerous chemical classes, compounds may be formulated in a variety of ways. The 

though typically they are organic molecules, preferably concentration of therapeutically active compound in the 

small organic compounds having a molecular weight of 65 formulation may vary from about 0.1-100 wt. %. 

more than 50 and less than about 2,500 daltons. Candidate The pharmaceutical compositions can be prepared in 

agents comprise functional groups necessary for structural various forms, such as granules, tablets, pills, suppositories, 
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capsules, suspensions, salves, lotions and the like. Pharma- 
ceutical grade organic or inorganic carriers and/or diluents 
suitable for oral and topical use can be used to make up 
compositions containing the iherapeutically-active com- 
pounds. Diluents known to the art include aqueous media, 5 
vegetable and animal oils and fats. Stabilizing agents, wet- 
ting and emulsifying agents, salts for varying the osmotic 
pressure or buffers for securing an adequate pH value, and 
skin penetration enhancers can be used as auxiliary agents. 

The following examples are put forth so as to provide m 
those of ordinary skill in the art with a complete disclosure 
and description of how to make and use the subject 
invention, and are not intended to limit the scope of what is 
regarded as the invention. Efforts have been made to ensure 
accuracy with respect to the numbers used (e.g. amounts, 15 
temperature, concentrations, etc.) but some experimental 
errors and deviations should be allowed for. Unless other- 
wise indicated, parts are parts by weight, molecular weight 
is average molecular weight, temperature is in degrees 
centigrade; and pressure is at or near atmospheric. 20 

EXPERIMENTAL 

EXAMPLE 1 

Polymorphic Variant of PC-1 Associated with 25 
Insulin Resistance 

Methods 
Subjects 

127 unrelated, healthy, non obese subjects (body mass 
index, BMI, <30 Kg/m 2 ) normotensive (blood 30 
pressure< 140/90 mm Hg), normal glucose tolerant (by 
OGTT) were studied. Plasma insulin levels were measured 
before and during an OGTT that was carried out after 8 days 
on a weight-maintaining diet. Insulin stimulated glucose 
disposal was carried out in a subgroup of 71 subjects by the 35 
euglycaemic, hyperinsulinemic clamp. 

Also studied were 132 type 2 diabetic patients (age- 
66.5±8.0 yr, 60 male/72 female, BMI -28 .9*4.5 Kg/m 2 ) with 
a strong family history of diabetes (one first degree relative 
with type 2 diabetes at the minimum). To minimize the 40 
possible inclusion of individuals affected by late onset type 
1 diabetes, patients were selected on the basis of age of 
diabetes onset^45 yrs, BMI^21 Kg/m 2 and no need for 
insulin therapy. 

Informed consent was obtained from participants before 45 
entry into the study, which was approved by the local 
research ethics committee. 
Polymorphism Screening 

Overlapping cosmid clones containing the PC-1 gene 
were isolated by screening a human chromosome 6 specific 50 
genomic library with human full length PC-1 cDNA. 
Cosmids were digested with, different four base cutter 
restriction enzymes, blotted and hybridized to oligonucle- 
otides designed on the cDNA sequence. Positive fragments 
were cloned and automatically sequenced. Intron-exon junc- 55 
tions were deduced comparing genomic and cDNA 
sequences. 

All exon amplimers, obtained using specific oligonucle- 
otides as primers, were analyzed in 40 unrelated and 
unscreened individuals by Single Strand Conformation 60 
Polymorphism (SSCP) which was performed as follows. 
Amplification reaction products were denatured for 5 min- 
utes at 87° C. in 90% formamide, 20 mM EDTA, 10 mM 
NaOH. After denaturation samples were chilled on ice, 
loaded on a native 8%-12% (according to amplimer size) 65 
acrylamide (29:1 Acrylamide-Bisacrylamide) gel (0.04x20x 
42 cm) in TBE and electrophoresed at 10 W constant power 



,185 Bl 

18 

for 13-16 hours at room temperature. After the 
electrophoresis, gels were treated by silver staining. PCR 
products showing different migration patterns at SSCP were 
cloned in a TA-cloning vector (Stratagene) and four clones 
for each sample were automatically sequenced from both 
ends. 

Exon 4 amplimers were obtained using oligonucleotides 
4Fw [SEQ ID NO:9] (5'-ctgtgttcactttggacatgttg-3') and 4Rv 
[SEQ ID NO: 10] (5'gacgttggaagataccaggUg-3') as primers. 
PCR products were digested using Avail restriction enzyme 
and run on 12% native polyacrylamide gel for 2 hours at 
500V. After the electrophoresis, gels were stained by silver 
nitrate. On the gel K alleles are displayed as single, uncut, 
bands of 238 bp, while Q alleles are shown as a doublet of 
148 and 90 bp. 

One -hundred-sixty unscreened blood donors were geno- 
typed as background population. All genotypings were per- 
formed in duplicate for each individual and to prevent 
observer bias the investigator was unaware of sample origin. 
Skin Fibroblast Culture and Insulin Receptor Autophospho- 
rylation 

Fibroblast cultures were established from 4-mm forearm 
skin-punch biopsies. I 125 insulin binding data were obtained 
by inhibition-competition studies. IR-TK (receptor 
autophosphorylation) was determined exposing cells for 10 
min to increasing insulin (0-100 nM) concentrations. Cells 
were then solubilized in 50 mM Hepes buffer, pH 7.6, 
containing 1% Triton X-100, 1 mM PMSF, 2 mM ortho- 
vanadate and 1% BSA and IR-TK determined. 
PC-1 Content in Muscle Tissue Specimens 

Muscle tissue specimens were obtained from the external 
oblique muscle at elective abdominal surgery 
(cholecystectomy) and were immediately frozen in liquid 
nitrogen. Soluble extracts were prepared from frozen muscle 
tissue and PC-1 content was measured by a specific ELISA 
as previously described and normalized for protein content. 
Statistical Analysis 

Group values are given as mcan±SD. Student's t-test or 
Mann Whitney U test were used to compare mean values of 
2 groups. One-way ANOVA and both Student-Newman- 
Keuls and Bonferroni t-test were used to compare mean 
values of more than 2 groups. Two-way ANOVA test was 
used to compare insulin dose-response curves of IR-TK. 
Chi-square test was used to compare allele frequency. 
Results 

The PC-1 gene has been located on chromosome 
6q22-23. Analysis of a YAC contig from the region, allowed 
it to be more finely mapped, to between markers D6S457 
and WI-3398. Only exon 4, which extends from nucleotide 
447 to 571 of the cDNA and codes for an extracellular 
portion of PC-1, showed a polymorphic variant. When 
screened by SSCP analysis and sequencing it revealed a 
frequent first position A-*C transversion at codon 121 
(considering the second in frame ATG as the start codon) 
(FIG. la). This single base change substitutes a glutamine 
for a lysine in a cysteine-rich region of PC-1 (SEQ ID NO:l 
and SEQ ID NO:3, respectively), and creates an Avail 
restriction enzyme recognition site. Avail digestion of exon 
4 amplimers cuts the Q allele PCR fragments, leaving the K 
allele undigested (FIG. lb). In 160 uncharacterized blood 
donors, the Q allele frequency was 17.5%, with only 2 QQ 
homozygotes. The observed genotype frequencies were in 
agreement with those predicted by the Hardy-Weimberg 
equilibrium. 

Having identified a PC-1 polymorphism which changes 
both amino acid composition and electric charge, and thus 
with potential biological relevance, we searched for an 
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association with insulin resistance. Accordingly we studied 
127 unrelated, healthy, non obese, normolensive, non dia- 
betic subjects resident in Sicily. As expected, these individu- 
als showed a wide range of plasma insulin levels during 
OGTT, a finding which in the presence of normal glucose 5 
tolerance, indicates a wide range of insulin sensitivity. These 
data were confirmed by the euglycemic hyperinsulinemic 
glucose clamp, a more quantitative technique for the mea- 
surement of insulin sensitivity. In a subgroup of 71 indi- 
viduals the M values for insulin stimulated glucose disposal JQ 
ranged from 2.34 to 9.62 mg/Kg/min. 

Table 1 summarizes the clinical features of these 2 groups. 
Q allele carriers showed higher fasting plasma glucose 
(p<0.001) (Table 1 and FIG. 2a) values. They also showed 
higher plasma insulin values at 60 (p<0.05) and 120 J5 
(p<0.01) minutes during OGTT (FIGS. 2a and 2b). 

TABLE 1 
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TABLE 2 



Q allele frequency and insulin sensitivity (M value) in subjects divided in 

tertiles according to plasma insulin level at 120 min during OGTT 
(1RI 120 min) 

Tertiles 

IR1120 1 2 3 

min range (1353-300 pmole/l) (273-147 pmol/1) (140-27 pmol/1) 



Q allele 29# 

frequency % n - 22 

M value 4.50 * 0.36* 

(mg/kg/min) n - 17 



15 
n-23 
5.46 * 0.30** 
n-20 



n - 22 
7.21 * 0.26 
n-17 





Clinical Characteristics of the subiect studied 






Gender 


Age BMI 


FPG 


FIRI 


Genotype 


(M/F) 


(years) (Kg/m 7 ) 


(mmol/1) 


(pmol/1) 


KK 


27/18 


36.6*21 23.8*0 JS 


4.7 * 0.1 


49.0 * 4.0 


(n-45) 










KQorQQ 


18/4 


403 ±3.1 24.2*0.8 


5.1 *0.1« 


60.0 * 8.0 


(d - 22) 











Data are expressed as mean * SEM. 

*p < 0.01 vs. KK subjects 

BMI - body mass index 

FPG - fasting plasma glucose 

FIRI •» fasting immunoreactive insulin 

In the subjects studied by glucose clamp, insulin stimu- 
lated glucose disposal was lower in Q allele carriers when 
compared to KK allele age, sex and BMI matched subjects. 
No difference was observed in insulin levels at steady state 35 
during clamp studies in the 2 groups (485+165 pmol/1 vs 
460+78). On the average, therefore, Q allele carriers were 
insulin resistant and maintained normal glucose tolerance 
due to compensatory hyperinsulinemia. Mean blood 
pressure, plasma total cholesterol, HDL cholesterol and 40 
triglyceride levels were not different between the 2 groups. 

Of the 2 subjects with QO alleles, one was a 35 yr. old 
male who was studied by euglycemic clamp and bad the 
second lowest M value (M-2.57 mg/Kg/min) of the all the 
XY males studied. The BMI (28 Kg/m2), blood pressure 45 
(138/90 mm Hg), and lipid profile (cholesterol/HDL ratio 
being 0.16 and triglycerides 176 mg/dl) were in the upper 
range of the studied individuals. The second QQ subject was 
a 52 yr. old female with BMI (21 Kg/m2) blood pressure, 
and her lipid profile was entirely normal. She did not agree 50 
to be studied by euglycemic clamp. Both QQ subjects were 
first degree relatives of a type 2 diabetic patient. 

When subjects were subdivided into tertiles according to 
plasma insulin levels at 120 minutes during the OGTT 
(tertile l=low, tertile 2-intermediate, and tertile=3 high 55 
insulin levels). As expected, the mean M value for glucose 
disposal progressively decreased from tertile 1 to tertile 3 
(7.22+0.26 mg/Kg/min, n-21 vs. 5:86+0.28, n-25 and 4.89+ 
0.23, n=25, p<0.001). Q allele frequency was similar in 
subjects from tertiles 1 and 2 (11.7, n-34 and 10.6%, n-33, 60 
respectively), but it was much higher in tertile 3 insulin 
resistant subjects (29.4%, n-34, p<0.01 when compared to 
the remaining 67 subjects, 11.2%). Also, in 133 type 2 
diabetic patients Q allele frequency was higher (20.8%, 
p<0.01) than in tertile 1 and 2 subjects with no difference 65 
between obese (BMI>30 Kg/m2, n-90) and non obese 
(n-42) patients (21.1% and 20.2%, respectively). 



Data arc expressed as mean * SEM. Number of subjects are given in 

parenthesis. 

#p < 0.05 vs tertile 3 

"p < 0.01 vs. tertile 2 and 3 

"p < 0.01 vs. tertile 3 

In order to exclude any association of the Q allele variant 
with other changes in PC-1, each of the 25 exons from a QQ 
control were sequenced from the start to the stop codon. No 
other base change was detected. 

In order to study IR autophosphorylation activity, cultured 
fibroblasts from 5 Q/K and 5 gender, age and BMI matched 
KK subjects were selected on the basis of a similar PC-1 
protein content (50.3+8.7 and 60.8+15.4 ng/0.1 mg protein, 
respectively). Q/K fibroblasts showed a reduced IR auto- 
phosphorylation activity (p<0.01) (FIG. 3). Insulin binding 
to its receptor was studied and no difference in both total 
specific binding (% of bound/total radioactivily-0.52+0.10 
per 0.1 mg protein and 0.55+0.11 in Q/K and KK subjects, 
respectively), and IC50(0. 27+0.05 nmol/1 and 0.26+0.08). 

PC-1 content was not significantly different in muscle 
specimens from 8 QK and 26 KKsex, age and BMI matched 
subjects (36.5±5.1 ng/mg protein vs. 25.9±2.6 in QK and 
KK subjects, respectively. 
Discussion 

The data provided herein demonstrate that a PC-1 gene 
polymorphism (K121Q in exon 4) is associated with 
decreased insulin sensitivity in healthy non-diabetic indi- 
viduals. Because insulin resistance is a major risk factor for 
the development of type 2 diabetes, Q allele carriers may be 
at higher risk to develop diabetes. This is supported by the 
high Q allele frequency observed in patients with type 2 
diabetes mellitus. No association was observed with BMI 
both in healthy and diabetic individuals. 

We previously reported that increased PC-1 content in 
skeletal muscle and adipose tissue is associated with insulin 
resistance. In addition, when cultured cells overexpress 
PC-1 they are insulin resistant secondary to both decreased 
IR tyrosine kinase activity and reduced downstream signal- 
ing steps. These latter observations indicate that an increased 
PC-1 content may play a role in insulin resistance through 
the inhibition of IR-TK activity. 

PC-1 content is not significantly different in skeletal 
muscle from KQ with respect to KK subjects, indicating that 
insulin resistance in Q allele subjects is not due to an 
increased PC-1 protein content. Again, these data suggest 
that structural differences between the 2 variant proteins 
may account for different insulin sensitivity, independent of 
protein content. These data indicate PC-1 is an important 
candidate for the genetic regulation of whole body insulin 
sensitivity. PC-1 genotyping can be used for identifying 
individuals at risk of developing insulin resistance. 

EXAMPLE 2 
Fasting Plasma PC-1 and its Regulation by Insulin 
A soluble form of PC-1 is generated by intracellular 
cleavage of its transmembrane domain, and subsequently 
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released by the cell. It is not known whether soluble PC-1 
circulates in human plasma. The possibility of measuring 
PC-1 in human plasma would considerably increase the 
feasibility of screening studies. 

A sensitive and specific ELISA was set up, and used to 
measure plasma PC-1 concentration before and after a 
2-hour euglycemic hyperinsulinemic clamp in 22 healthy 
control, and 27 subjects affected by diseases known to be 
associated with insulin resistance (i.e. obesity and essential 



Results 

Subjects Studied 

Twenty two healthy control and 27 subjects affected by 
either obesity (BM1>28, n=10) or essential hypertension 
(mean blood pressure>108 mm Hg, n-12) or both (n=5) 
were studied. Clinical and metabolic features of the 49 
subjects are shown in Table 3. As expected, insulin 
sensitivity, as indicated by M values derived by euglycemic 
hyperinsulinemic clamp studies, was significantly reduced 



sex BMI W/H MBP BG IRI M 



SEM 

Insulin Resistant 



12/10 23.8 
0.4 



hypertension). The obtained results indicate that low fasting 10 in obese and/or hypertensive patients as compared to normal 
level and abnormal acute regulation by insulin of plasma controls. 
PC-1 concentration are associated with several features of 
the "metabolic syndrome", including abdominal fat 
distribution, high blood pressure and low insulin sensitivity ~ ~ 
on both glucose and lipid metabolism. 15 

Methods Control 

Plasma PC-1 Measurement 

Wells in Maxisorb plates were precoated overnight incu- 
bation at 40° C. with an affinity purified polyclonal antibody 
to PC-1. After washing with TBST buffer (20 mm Tris, 150 20 
mm NaCl, 0.05% Tween-20) to remove unbound antibody, 
wells were blocked with 150/d TBST containing 1% bovine 
serum albumin (BSA) (30 min at 56° C), and washed again 
with TBST. Then, human plasma (10-30/4 diluted to a total 
volume of 100 fd with 50 mM HEPES buffer, pH 7.6, 25 
containing 0.05% Tween-20, 1 mM PMSF, 2 mM 
orthovanadate, 1% BSA and 1 mg/ml bacitracin) was added 
to each well and PC-1 was allowed to bind overnight at 4° 
C. After extensive washing with TBST, a biotinylated anli- 



47 
2 



29.2 
0.8 



0.83 
0.03 



0.92 
0.02 



109 
3 



5.3 
0.1 



Plasma PC-1 Concentration 

Fasting plasma PC-1 was measured by ELISA. Human 
plasma produced a dilution slope that paralleled the PC-1 
standard. Intra- and inter-assay coefficient of variations 
were<8%. Plasma PC-1 concentration ranged from 1 to 70 
ng/ml with a mean±S>E>of 26 .5 ±2. 9 and a median of 24.5. 
PC-1 monoclonal antibody was added in the 50 mM HEPES 30 No significant difference was observed between plasma 
buffer. After 2 hr at 22° C, peroxidase-streptavidin was PC-1 concentration in control (27.7*4.5, n=22) and insulin 
added and 30 min later, wells were washed again with TBST resistant obese and/or hypertensive (25.6*3.9, n=27) sub- 
and then 100 pi of biotinyl-tyramide solution was added. jecls. 

After 15 min incubation at 22° C, wells were washed with When the 49 subjects were considered together, plasma 
TBST and streptavidin-horseradish peroxidase was added 35 PC-1 concentration was correlated negatively with both 
(30 min at 22° C). After further extensive washing, the waist/hip ratio (-0.49, p=0.001) and systolic blood pressure 

(-0.36, p«=0.016) and positively (0.40, p=0.01) with the 
ability of insulin to suppress plasma FFA (delta FFA, cal- 
culated by subtracting basal FFA from FFA after the two 
40 hour euglycemic hyperinsulinemic clamp). Plasma PC-1 
concentration remained significantly correlated with the 
waist/hip ratio also when data were adjusted for BMI and 
sex (p=0.0024), with systolic blood pressure also when data 
were adjusted for sex and age (p-0.019) and with delta FFA 
blood removed, specimens were immediately frozen in 45 also when data were adjusted for BMI, sex and waist/hip 
liquid nitrogen. Soluble extracts were prepared from frozen ratio (p-0.037). 

muscle tissue as previously described. Briefly, muscle tissue These data demonstrate that PC-1 circulates in human 
(approximately 150 mg) was pulverized under liquid nitro- plasma and that low plasma PC-1 level is independently 
gen and then homogenized in 2 ml buffer (50 mm HEPES, associated with several features of the "metabolic syn- 
150 mm NaCl, 2 mm PMSF, pH 7.6) at 4° C. using a 50 drome" including abdominal fat distribution, high blood 
polytron homogenizer for 10 sec. at medium speed. Triton pressure and, so far as lipid metabolism is concerned, insulin 



peroxidase activity was determined calorimetrically by add- 
ing S^.S-S'-tetramethylbcnzidine (TMB) at a concentration 
of 0.4 g/1 in an organic base, and measuring the absorbance 
at 451 nm. 

Muscle PC-1 Measurement 

Muscle tissue specimens were obtained from the external 
oblique muscle at elective abdominal surgery 
(cholecystectomy). After adipose tissue was dissected and 



X-100 was added to a final concentration of 1%, and the 
homogenates solubilized for 60 min at 4° C. The material 
was centrifuged at 100K g for 60 min at 4° C. and the 
supernatants used for the PC-1 content measurement. 
Statistical Analysis 

One way analysis of variance (ANOVA) was utilized 
when means values from 3 groups were compared. Paired 
Student's t test was utilized to compare mean values before 
and after clamp. 

Correlation (either "Pearson" if the data was distributed 
normally or "Spearman" if the data was not distributed 
normally) analysis was used to look for numerical relation- 
ship between values. Statistically significant correlations 



resistance. 
Insulin Stimulated Values 

In order to verify whether insulin exerts any effect on 
plasma PC-1, PC-1 was measured after two hour euglycemic 
hyperinsulinemic clamp. Although the mean plasma PC-1 
concentration in the 49 subject after clamp was not different 
as compared to basal plasma PC-1 level (26.3±3.9, vs. 
26.5±4.1), a wide range of the individual effects of insulin 
infusion were observed, from subjects showing an increase 
to subjects showing either no change or a reduction in 
plasma PC-1. When subjects were divided in tertiles accord- 
ing to their whole body insulin sensitivity on glucose values 
(M values), with the most insulin sensitive in tertile 1 and the 



were confirmed by linear regression analysis. Stepwise 65 most resistant in tertile 3, insulin stimulated PC-1 concen- 
regression analysis was utilized for multiple correlations. trations were significantly higher than basal plasma PC-1 
Data are given as meaniSEM. concentration in subjects from tertile 1 (21.7±5.4 vs. 
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25.8±5.5 before and after clamp, respectively, p=0.01 5, n-1 
6) but not in subjects from tertile 2 (31.7±5.0 vs. 31.9*4.6, 
n.s., n=17) and 3 (25.9*4.7 vs. 20.7±3.7, n=16). Moreover, 
the net effect of insulin on plasma PC-1 concentration (delta 
PC-1, calculated by subtracting basal PC-1 from insulin 5 
stimulated PC-1) was positively correlated with M value 
(0.37, p=0.009) and negatively with BMI (-0.37, p=0.009). 
A significant (p«0.005) positive correlation between M and 
delta PC-1 values was observed also when data was adjusted 
for BMI. A similar correlation between M and delta PC-1 10 
values was observed also when control (0.049, p-0.05, 
n-22) and insulin resistant obese and/or hypertensive (0.48, 
p=0.01, n=27) subjects were considered separately). 

These data demonstrate that insulin infusion is able to 
increase plasma PC-1 concentration in the most insulin 15 
sensitive subjects and that this effect is blunted in subjects 
with lower insulin sensitivity. 
Plasma v. Muscle Tissue PC-1 

In order to verify whether plasma PC-1 concentration was 
related to PC-1 content in skeletal muscle, we quantified 20 
PC-1 in both plasma and biopsied external oblique muscle of 
9 additional subjects. PC-1 concentration in plasma was 
inversely correlated with PC-1 content in muscle (-0.9, 



,185 Bl 

24 

p=0.01). These data are compatible with the possibility that 
the increased PC-1 content previously reported in skeletal 
muscle of insulin resistant subjects is, at least in part, due to 
reduction of PC-1 intracellular degradation, and its subse- 
quent release into extracellular fluids, at the level of skeletal 
muscle tissue. 

EXAMPLE 3 
Intron/Exon Structure of PC-1 

The nucleic acid sequences provided below are the intron- 
exon boundaries for the human PC-1 gene. It contains all the 
intron sequences immediately flanking the PC-1 exons. A 
few bases of the exon 5' and 3' are also provided, which are 
separated by a "-" sign from the intron, and are further in 
bold type. 

The 3' flanking sequence to exon 2 (i.e. intron 3 at its 5' 
end) contains a GT repeat that is polymorphic, and provides 
a marker for genotyping of this locus. The sequences flank- 
ing the boundaries or crossing them are useful for specific 
amplification of the exons. 



Intron Exon borders 
[SEQ ID NO : 11] Exon 1 

NGGGAGGGCGGCGCCGAGCTCCTGCGCTCTCAGCGCACTCAGCACCGGGCACGGA 
[SEQ ID NO: 12] Exon 2 

TGAGCTCC ACCGGGCCGGCGGCCGC TCTAGAACT AGTGGATCATGCC AC TGTACCC TAGCCTGGGT AAC AGAG TA 
AGACACTATCTCTAAAAATAAAAAATAAGATAAAATATTTTTTAAAAA 

CCTACAG- CTATTC ... . ACAAG-GTAATT AGGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATGT 

GTGTGC AC AGCC TT ATT AAGAATGTGATTG AGGT AAACATTATCTCC TATTCCCAAGGGGTAC 
[SEQ ID NO: 13] Exon 3 

AGATTTTT GC CTTACTTTATT ACCCCATCTGT ATTTTGTAAAGTAGTATTTGAACC TAGTGT ACACCTAACTT AG 

TTGTATTCGTTGATGTTTACTTTGAATTATATAATGATTAGAAACATCTGACTTATCK 

TTAA CCAG-GTAAG 



TGGAGCTT 

[SEQ ID NO: 14] Exon 4 

CGCGGCGGCCGTTCTAGAACTAGTGGATCATACTCACGAAGACAGCAATTC 

TTTG AGAC AT AAAACACAT TTTGCTGATGTTTGTTTCTAG - AACATA . . .GTCAAG- 

GTCAGGTGCTCGTTGGGCT CTGC AGCAACC TGGT 

ATCTTCCAACCTCTTAACGGGGCTNTACATAAGTGTTATCTTTTATATTAAGAKTCATGGCT 
[SEQ ID NO: 15] Exon 5 



GTGTGTTCATTTTATTTTCTTGAAAATATTTTAG-GT GAGAA GCAGG— 

GTAAGATT ATATTC TGAGGTATT AATTTTTTC TTTTTT 

AGAAGT AC AGCATC ATTTTTTTC TTTCCAAATTAAGATGATAAAAAT AAT AAAATC AC TGGTTT ATTAAAC ATTA 
C AGGTTGAGTAT CC TTT ATCC AAAATGTTTGGT ATGAG AACTGTTTrGGATTTTGGAC TTTTTTGG ATTTTGCAA 
TATT 

[SEQ ID HO:16] Exon 6 

CCGCAGCCOGGGGGATCACACAGACCTTAGTGGAAAATCTTCACT 
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-continued 

ATTGGAT ATGTC TTGTCTT TGC TTC IT TAAAC AT T X Trrx TTC TlTT TC ATT AC C C AG - G TTTG . . . AAACTAA- 
GTGAGTAACTTCAGAG 

T TT AC T GC TGGAAT ATC AC CATTTC AG TG AGATT G ACT AGGCAGGCAGT CTTT C TT GG AAAAGT AC TGGCAGAAC 
C T AAC T GT TTCACT AAACT TT TC T AAT GGGCAAAGT AGTTGAACC TT GT GT Ag GGC GC CT T ATC TTT AAT AATG T 
GA 

[SEQ ID NO: 17] Bxon 7 

T AAGAGAAAAAT G AAGTCA TC TTTAAGATTGG ATTTGT ATCCACAGTGTTGCT TTATAATTC ATCCTGAATTTTT 
ATCTGATT AAAATC CCTCC T<XX»TAATTTTTTTT ACffTGATTT AG ACTGCTGT GG T AC CACTGCTAAAT G AGGT A 

AGCC AATT G T CAG ATGT AT TT AAT AAC AAT GT TT AT TTTT TTC CCTT C T AG - AAAAATGT TCACC- 

G T AAGC TC TGCATT TC AAC TT C T AT C T GTTTG AAGAAGTG AGATGGG AT TGT AACATTTT TT GAGGGAAT AGATT 
T AAG AT AAAAGAAAAAC AAC T T ATT TT C CAAT AG GT AG TT AAGT AAGGAAAC C CAGGTTCTXJ ATCTTTGCTCTGC 
CACAAACTAGCTGTGGCT 
(SHQ ID HO: 18] Bxoa 8 

ACTAGATAAAATCTTAAGAGGTTGCGTTTTGCGATTACCTGATTTTTTTGTTO 
ATTCCATGTAGCTTCAGTTATCGGTTTCTTTTTGATGATTTTTT^ 
AACTTGCAT AT AAT CTGT 

TTTATC TTTTTTAG - GGATT . . . . AACCA-GTGAGTTCTTTGTTTTTCTACTAA 

AAT AGTTAATT ATT CTC ATCT ATTTGAATC AG AGTAAAAT AACCAGATTCTCT AGAGCTT TT AAT AAC T GATTTC 
ATTTAG TGTGTC TGTGGCC AT 
[SEQ ID NO: 19] Exon 9 

C TTGTTGACC TT AAAAGCATTCT AGCACTAGAGAGG AATGCATTGGTGTGGTATGAAAAC AT ACTTTCC TAAGAG 

ATGAATGTTGCATGATTTCTTAATTTTCCTTCATTTTCTGCTCCAG^ AATGG -GTATGTG 

AAATGAATTTTT TC TAGGATC TG T AAT AT AGAAC AGCTT ATTCTTATGT AATGTGGTTTT TATTGAATC CTGAGC 
TTTAGC ATTTGAGTGAT AT GT TGGC TGAAAAATG AGAACT GAAGAAC TCTTTCTCAAAGAGTTT AGAT AGATGGT 
AAATGGAC ACT AAAAC TA 
[SEQ ID NO: 20] Exon 10 

TGACATTTTC AATG AAAAAAACT AT AT TTT AC ACCC AAAC AATTCTC AGCCATCTTTTA'] 
ATTTT AG- TTCAGT A . . . . AGATGAAAG- GTCTGTAGGCAATTAATTTCT ATI GT AAAT A 
G AAATG AT AT AC T ATTTTC CC CT AG AC TAC AACAAAACTTTGC TATTTGCT AT GATGTTT T ATATCGAAAT AAAT 
TCTTTAGTAAATGATC 
[SEQ ID NO:2l] Exon 11 

G AATTT CAAAGC TG T AAAT T AATTT CT CAG TAG AAC TGTT ACACCAGTGTT AT AAAATTAAT CCCT ATC AATTGA 
GGAATT ATTTTTTC CATTC TGTTTT TC AATGTGTTC GT AAAAT ATT ACATTTTGAT ACTGTT TGATTTAG- 
ACCACA. . .CAGTGA[1 -GTAA 

GT ACATTTTTCTCAGTAAT T ATTTC AT TAAAC CC AG TC AT CGGGCTG AACC TC GCTTTGAAGGAGGCTGCT AG AC 
C ATTTT AT AAGATTCTATC AT TT CTGG AAAAAGC AAOT ATT AT ACAC AATATT ACT AAAT AT AAGG ATGCACTTT 
AAACAAAATAAGAGTTGG 
[SSQ ID NO: 2 2] Exon 12 

GTCTTAGTTT AATG TG AAT CAGC TC AT TGT AGTT GC ATCC ACTGGCC CAAATCT AT CAAT C T GT CGGTC TTTCTT 
TCTTTC TTTGTTTC TTTCT n TTTTTT iirn T AAC AG AG AT A GC TTT ATG TAT AAAT AGC C ATT AGTG TGGAAG 
GTATCACATGAGGTTGTGC TTCCCATTCTT AG - C TCATC ATCATC-GTAATCTGAATTTGCATTA 
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-continued 

TTTACTCTTCAGGATAAAGGGCTGAAGAAAGTTTAC 
TCTTTTTTAAAAATGTAGTTTCTTATGGACAGTCTTTAGGAAAAA^ 
ACAGAATTCACAGAAACC 
(SEQ ID NO: 23] Exon 13 

GATrTTGAAAAAAGTGAAGTGATAGCTACAGCTGAAATTCTGTCTTACCTATC 
CTACAC CC ATGTTT AACGAATTT AACCTTGGJLAGTG AAAG AAGTTCTGC 

TTACAGCATGTTTTGGGATTTTTTTTTTCTCCTAG- GCATGG T ACT ATTCAT - GT AAGT AT ATC TC 

TGTGATAACTTTGAATATGGTCATATTAAGAATACCTTCCTTTAG^ 

C AGCAC TTTGGG AGGCC AAAGTGGGTGGTC AC CT G AGGTC AGG AG TTCG AG AC CAGCCTGGC CAAC ATGGTGAAA 
C CCTAAAAAT AC AT ATAC 
[SBQ ID WO-.24] Exon 14 

GATCCAAACTCTGCATTTAAATACCAAGGCAGGTTTTAAAG 

AGGAATTAGAOVGACCTTTAGGGATCTGACATTCTATATTTTTGTATTATGTTTTAAT 

T ATTAATT CTTATGTTTGTTCCCCTCC AG- TT AACT ATGAAGGCA"raGC CCGAAATCTT^TCT -GTGAGT A 

TCTTTATTTTCCATTATCTAGTTATTTTTACTTTTGTATAATATATATTGA 

TGGGATTGAAGGATTAGAATATT1TEAGTAATCTGGGCCAACATGGAAATGCTGTGTAGTTTAAAGATC 
[SBQ ID KO:25j Exon 15 

CTGATGAAATGTTTGTGAAAAAAAATTTCATATGAAGTTAGAAAGCAATTTCAAGAAAAGTTGACA 

GAT ATT AGGG AAAT ATC TT TC C C T AAT AAAT ATC TT TC C C T AAAAAAGTTG AC ACT TTTr T AGAT ATT AGGGAAA 

TAATAGTTTTTCTTTGCT 

GTTTGCAATTTCAG--TGCCGGC CCATT - GT AAGTTCTGACAGTCTCCCAG 



GT AT AAGT ATGATT CTCTTCACTCT AACCC AGGATTTC TAATGTCGGCCTATGGATGTTTGAGTTAGAT AATTCT 
TTGTTG TGGAGAGC TGTC 
[SBQ ID NO:26j Exon 16 

AAAAGATAGAGGTGACTTCTTAATGCTTTTCAAAGCCAGGTGG 
AGTT AC AT ACTT TTTAATC AATGAAAATAATGTT ATG 

ATTATC AATT AT GTTTTATGAAAGG AC TTT AC ATTTTT AATTC ATAT ATGTCAACATT AG - G AAT GCAA- 

TCTAAAGAAAAAATGATATGCAAAGTTTTAGACTTGAAAACATACTGTGATTATATGTCT 
GGAACATACTT^CATAAAGCTATTTTTCTTTGAACATTAAAGAATTTTGTTA 
ACTAAAAAGTCAAAAAAC 
[SBQ ID NO: 27] Exon 17 

AAAACT AAGAGACC TATCC T AGATGTC CTT AG ATTATGTGTGT GATAGGGTTAAAACT AT ATTTCC CAC AAAGTC 
CACTGAGCGTGGTAGTTTTCCTCTTATCTTATCATAACGAGTTTGTATATGTACAATC 

GGGACC AACTTGTAGACAGCTGAAATGCACTG AT AAAC TTCCTTTTC TGGCCATCT AG-CCCCT GTC 

TG-GTAAGTGTGAACAGGTGCCTTTTTTCCCTTCTGAAAATAGACCrGAAATAGGA 
TTATCAAAAG<^GGTCACATTGTAGGCAACTTTGTCGAGATGATGGTGAGG 
TGACTCTCAGACTCACTGAAGAAATGTGGGGAACATG 
[SEQ ID HO: 26] Bxon 18 

C ATATC AGTATTTC T ATT AAAAAT AAC CT AGTC TTAAAT AC TC TAAAAC CC AAGAGAGTTTT AT ACTTTTATTTT 
AGTTAAAGAGTAAATGACTCATGTATTTGGTTTTAAAAAAGTAAAGA 

TTTG AAAC ATCT AAGTAAC TC TACC ATCTTGAAATT ATGC AG- ATTT A CTTCG-GTAAGTATCGTCAA 

GAAGTTTGGTCCAGTATGTATGGTTTGATAGCACCCTCTGCATAGCATGTGCTGT^ 



29 
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-continued 

TTAgAATTTAGGAGTGGGGGTAGCTAAACATATGTTTTAA'nCTAGGGGGCGCATGTAAATCTrTTGTGATATAT 
[SEQ ID BO: 2 9] Kxon 19 

GTGAAAGAGCAACACTC TTGC CTTGAAAGAGAAAAAAAAATCC ACTAAT ACAAGACTATC ATAAATGATCTTTGT 
TTTATGTTCKAATAATCAATCTATAGCGGTCTATGTTACAAAATTTAAAACATGTC 

TTTTAT AACCTTTTTTCAG - ATTTTGCC GAAO-GTAAGGCATGCTACACACTCAAGCTCGGAATGTG 

AAGCAGGC ATTTTCTCATC AGTGTG AAATGCAGAGAAC TGGCTTGGGGGTATT ATTTGAG AATAACCAATAAAAT 

AAAGGGAGTTCTGGAGGACCACCTGATGAAACATAGAGGTTTCTTTGCT 

I SHQ ZD BO: 3 0] Bxon 20 

GTCTTC TT AATTGTTTATGCTTGTACC CTTTGTAATCAGTTTTTTTAAT AGTT AAAAGTAAATCTTCAATATAAT 
TAAGTAGAGGAAAGGATTAGATGAGTGTATCACACTATATATTATCATATAATGCACACTAACTACATTTATT^ 

CATCCTGTGACCCAAG-A GAAGATTA GACAGAAAX-GCAAGTATTTGTCACC TC TTTATGTGTGGCC 

ATTTCAAATTAATGATTAAGCAGAACATTAAATGCATAGTTTCTCACTGOT 
C GCATT AG AGGAAC ACTGAAGAGGG AGTCAGAAAAAT 
[8EQ ID NO:31J Bxon 21 

TTTAATATTGTAAAGCATTTTTACACTTTAGTTAGAAAAAAAGATGAATATACTAGTAGfGAAAATAGGGAAGGAC 
ATGAGCTGACAGCTAGAGCTTCATAATTTTATGATGTAGTTCACCTTTAAATATTAATA^ 

TGTGCCTGATATCTGAGAGTTCTTCTCATTTTCGTTCTTCAG— CACA CCACCAC-GTAAGTTTTTTCC 

TCTCCT GACCTT CC CTTTTCTCCTTTTTGTTTTCTTTCTTGTTT ATAAATC CT ACC AT AC AT TATAGGGTAATAT 

ATATATTACCTATTATATATATATAGCTATATATATATACCTTTGTTTATTTATTGTGA 

[SEQ ID NO: 3 2] Bxon 22 

CTCATCTTGAAAAGACTTCTTAAATATTTTATTTTTGTAAAGGA^ 
CCTGTACTTGGGAAAGTTTTAC^GGTTTAAGATGGTACTCAGCTAATTO 

AAGT AT AATTTC CT ATGTT ATTTGTGAAGAAT GAAAAAGTTGTCCTC TTTTCTCTTTGTAG- AACTA TT 

C AAG - GTAAATAATGTTAACTCT ATATTTGAT AATTTT AATGAATTT GT GC AC AT 
ATAGGCATAATTCATATGTATAGGACTTATGGTCTAAATTAAATGAATTAATACCAAATACATTCT^ 
AACTTTGAGAATACTAGTACACAAAAATTCTAC 
[SEQ ID NO:33j Exon 23 

CTGGGTGAT ATAGCACGACTC TGTCTC TAAAC AAAAAACAAAACAAAACGAAGACTGAAGCC AAACTTGACTTTA 
TCTTTATTTACTATAAATGCTAATTTTGAATCATGGTGTTAATTTATTTCACACGTCAACATGGTC 

TTTGAAACTACACTGGCTTCTATCTTGTTTCAG-TTATA GAGGCA-GTAAGAACATATTTCATTACTC 

TTAAAAATAGGAATTACCATCCAGTAGAAATGGGATTACCATCCAGTTGAGTCAAGAGAACCTTTTTTATCCAGT 
GTCGTATGTTTATGTGTATGACACTTCTGACTACACAGGAACK^CTCTTGAAATATCTGATTAATTTTGATC 
GCTCAATGTTCAGTAAAA 
[SEQ ID HO: 3 4] Bxon 24 

GTrCTTATATTTAATTATTGGTTGGAATTTGATTTTTATATGTATTAAAAGCATGCTCTACTGAAATATTCATCA 

AAAGGAAGATAGTTATTTCTTTCTTAAAATGAATATTGGCATGTTTTACAG-AAAAA TCTGTG- 

GTAAGTAGCTTTTGTATATTTAC 

TTTGCATGTTGAAAATCTAGACATATGCATATTTGTTTATGTC 
ACTGAGTACACATGGACTTCGAAATTATAGGATGCTTTTAAATTTGATCTTT^ 
G ACT AC CC TGTC TGCTTT 
[SEQ ID BO: 35] Bxon 25 

AATTAAACAAACATGCATGGTATGTATTAGAAGGAAAGCTACTCAAGAGG^ 
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GGCAC G TT C C AC TT CAG AGCT G AAA TC TCG TAAATG ATT AAAC TGGGG AGATGGAGCACT T AT AGAAGT GAAC TG 

AGTGTTCTCTTGGT AAC TTTTCTTTTATATTTCC TATTCTCCT AG -C ATGC . . . . ATTAA- AAAAG AAAAA 

T ATT CC T ATC CT GC TCACT GG T AAT T AACAT AGG TT TAAAATGGC TT CAAATGTGGCC CT AT AG AC GGT T AAAAT 

ATGTTTCTTTCTGTGCAC 



It is to be understood that this invention is not limited to 
the particular methodology, protocols, formulations and 
reagents described, as such may, of course, vary. It is also to 
be understood that the terminology used herein is for the 
purpose of describing particular embodiments only, and is 
not intended to limit the scope of the present invention 
which will be limited only by the appended claims. 

It must be noted that as used herein and in the appended 
claims, the singular forms "a", "and", and "the" include 
plural referents unless the context clearly dictates otherwise. 
Thus, for example, reference to "a complex" includes a 
plurality of such complexes and reference to "the formula- 
tion" includes reference to one or more formulations and 
equivalents thereof known to those skilled in the art, and so 
forth. 

Unless defined otherwise, all technical and scientific 
terms used herein have the same meaning as commonly 



understood to one of ordinary skill in the art to which this 
invention belongs. Although any methods, devices and 
materials similar or equivalent to those described herein can 
15 be used in the practice or testing of the invention, the 
preferred methods, devices and materials are now described. 

All publications mentioned herein are incorporated herein 
20 by reference for the purpose of describing and disclosing, for 
example, the cell lines, constructs, and methodologies that 
are described in the publications which might be used in 
connection with the presently described invention. The 
publications discussed above and throughout the text are 
25 provided solely for their disclosure prior to the filing date of 
the present application. Nothing herein is to be construed as 
an admission that the inventors are not entitled to antedate 
such disclosure by virtue of prior invention. 



SEQUENCE LISTING 



<160> NUMBER OF SEQ ID NOS: 35 

<210> SEQ ID NO 1 

<211> LENGTH: 34B6 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 

<220> FEATURE: 

<221> NAME/KEY: CDS 

<222> LOCATION: (164) .. .(2785) 

<400> SEQUENCE: 1 

ggccacgatg gagcgcgacg gctgcgcggg gggcgggagc cgcggoggcg agggcgggcg 60 

cgctccccgg gagggcccgg cggggaacgg ccgcgatcgg ggccgcagcc acgctgccga 120 

ggcgcccggg gacccgcagg cggccgcgtc cttgctggcc cct atg gac gtg ggg 175 

Met Asp Val Gly 
1 

gag gag ccg ctg gag aag gcg geg cgc gcc cgc act gcc aag gac ccc 223 
Glu Glu Pro Leu Glu Lys Ala Ala Arg Ala Arg Thr Ala Lye Asp Pro 
5 10 15 20 

aac acc tat aaa gta etc teg ctg gta ttg tea gta tgt gtg tta aca 271 
Ann Thr Tyr Lys Val Leu Ser Leu Val Leu Ser Val Cys Val Leu Thr 
25 30 35 

aca ata ctt ggt tgt ata ttt ggg ttg aaa cca age tgt gcc aaa gaa 319 
Thr lie Leu Gly Cya He Phe Gly Leu Lya Pro Ser Cys Ala Lys Glu 
40 45 50 

gtt aaa agt tgc aaa ggt cgc tgt ttc gag aga aca ttt ggg aac tgt 367 
Val Lye Ser Cys Lya Gly Arg Cys Phe Glu Arg Thr Phe Gly Aon Cys 
55 60 65 

cgc tgt gat get gcc tgt gtt gag ctt gga aac tgc tgt tta gat tac 415 
Arg Cys Asp Ala Ala Cys Val Glu Leu Gly Asn Cya Cys Leu Asp Tyr 
70 75 80 
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cag gag acg tgc ata gaa cca gaa cat ata tgg act tgc aac aaa ttc 
Gin Glu Thr Cys lie Glu Pro Glu His lie Trp Thr Cys Asn Lys Phe 
85 90 9S 100 

agg tgt ggt gag aaa agg ttg acc aga age etc tgt gec tgt tea gat 
Arg Cys Gly Glu Lye Arg Leu Thr Arg Ser Leu Cys Ala Cys Ser Asp 
105 110 115 

gac tgc aag gac aag ggc gac tgc tgc ate aac tac agt tct gtg tgt 
Asp Cys Lys Asp Lys Gly Asp Cys Cys lie Asn Tyr Ser Ser Vol Cys 
120 125 130 

caa ggt gag aaa agt tgg gta gaa gaa cca tgt gag age att aat gag 
Gin Gly Glu Lys Ser Trp Val Glu Glu Pro Cys Glu Ser lie Asn Glu 
135 140 145 

cca cag tgc cca gca ggg ttt gaa acg cct cct acc etc tta ttt tct 
Pro Gin Cys Pro Ala Gly Phe Glu Thr Pro Pro Thr Leu Leu Phe Ser 
150 155 160 

ttg gat gga ttc agg gca gaa tat tta cac act tgg ggt gga ctt ctt 
Leu Asp Gly Phe Arg Ala Glu Tyr Leu His Thr Trp Gly Gly Leu Leu 
165 170 175 180 

cct gtt att age aaa eta aaa aaa tgt gga aca tat act aaa aac atg 
Pro Val lie Ser Lys Leu Lys Lys Cys Gly Thr Tyr Thr Lys Asn Met 
1B5 190 195 

aga ccg gta tat cca aca aaa act ttc ccc aat cac tac age att gtc 
Arg Pro Val Tyr Pro Thr Lys Thr Phe Pro Asn His Tyr Ser He Val 
200 205 210 

acc gga ttg tat cca gaa tct cat ggc ata ate gac aat aaa atg tat 
Thr Gly Leu Tyr Pro Glu Ser His Gly He lie Asp Asn Lys Met Tyr 
215 220 225 

gat ccc aaa atg aat get tec ttt tea ctt aaa agt aaa gag aaa ttt 
Asp Pro Lys Met Asn Ala Ser Phe Ser Leu Lys Ser Lys Glu Lys Phe 
230 235 240 

aat cct gag tgg tac aaa gga gaa cca att tgg gtc aca get aag tat 
Asn Pro Glu Trp Tyr Lys Gly Glu Pro He Trp Val Thr Ala Lys Tyr 
245 250 255 260 

caa ggc etc aag tct ggc aca ttt ttc tgg cca gga tea gat gtg gaa 
Gin Gly Leu Lys Ser Gly Thr Phe Phe Trp Pro Gly Ser Asp Val Glu 
265 270 " 275 

att aac gga att ttc cca gac ate tat aaa atg tat aat ggt tea gta 
He Asn Gly He Phe Pro Asp He Tyr Lys Met Tyr Asn Gly Ser Val 
230 285 290 

cca ttt gaa gaa agg att tta get gtt ctt cag tgg eta cag ctt cct 
Pro Phe Glu Glu Arg He Leu Ala Val Leu Gin Trp Leu Gin Leu Pro 
295 300 305 

aaa gat gaa aga cca cac ttt tac act ctg tat tta gaa gaa cca gat 
Lys Asp Glu Arg Pro His Phe Tyr Thr Leu Tyr Leu Glu Glu Pro Asp 
310 315 320 

tct tea ggt cat tea tat gga cca gtc age agt gaa gtc ate aaa gec 
Ser Ser Gly His Ser Tyr Gly Pro Val Ser Ser Glu Val He Lys Ala 
325 330 335 340 

ttg cag agg gtt gat ggt atg gtt ggt atg ctg atg gat ggt ctg aaa 
Leu Gin Arg Val Asp Gly Met Val Gly Met Leu Met Asp Gly Leu Lys 
345 350 355 

gag ctg aac ttg cac aga tgc ctg aac etc ate ctt att tea gat cat 
Glu Leu Asn Leu His Arg Cys Leu Asn Leu He Leu He Ser Asp His 
360 365 370 

ggc atg gaa caa ggc agt tgt aag aaa tac ata tat ctg aat aaa tat 
Gly Met Glu Gin Gly Ser Cys Lys Lys Tyr He Tyr Leu Asn Lys Tyr 
375 380 385 

ttg ggg gat gtt aaa aat att aaa gtt ate tat gga cct gca get cga 
Leu Gly Asp Val Lys Asn He Lys Val He Tyr Gly Pro Ala Ala Arg 
390 395 400 
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ttg aga ccc tct gat gtc cca gat aaa tac tat tea ttt aac tat gaa 
Leu Arg Pro Ser Asp Val Pro Asp Lys Tyr Tyr Ser Phe Aon Tyr Glu 

405 410 415 420 

ggc ott gec cga aat ctt tct tgc egg gaa cca aac cag cac ttc aaa 

Gly lie Ala Arg Aen Leu Ser Cye Arg Glu Pro Asn Gin His Phe Lye 

425 430 435 

cct tac ctg aaa cat ttc tta cct aag cgt ttg cac ttt get aag agt 

Pro Tyr Leu Lys His Phe Leu Pro Lys Arg Leu His Phe Ala Lys Ser 

440 445 450 

gat aga att gag ccc ttg aca ttc tat ttg gac cct cag tgg caa ctt 

Asp Arg lie Glu Pro Leu Thr Phe Tyr Leu Asp Pro Gin Trp Gin Leu 

455 460 465 

gca ttg aat ccc tea gaa agg aaa tat tgt gga agt gga ttt cat ggc 

Ala Leu Asn Pro Ser Glu Arg Lys Tyr Cys Gly Ser Gly Phe His Gly 

470 475 480 

tct gac aat gta ttt tea aat atg caa gee etc ttt gtt ggc tat gga 

Ser Asp Asn Val Phe Ser Asn Met Gin Ala Leu Phe Val Gly Tyr Gly 

485 490 495 500 

cct gga ttc aag cat ggc att gag get gac acc ttt gaa aac att gaa 

Pro Gly Phe Lys His Gly He Glu Ala Asp Thr Phe Glu Asn He Glu 

505 510 515 

gtc tat aac tta atg tgt gat tta ctg aat ttg aca ccg get cct aat 

Val Tyr Asn Leu Met Cys Asp Leu Leu Asn Leu Thr Pro Ala Pro Asn 

520 525 530 

aac gga act cat gga agt ctt aac cac ctt eta aag aat cct gtt tat 

Asn Gly Thr His Gly Ser Leu Asn His Lou Leu Lys Asn Pro Val Tyr 

535 540 545 

acg cca aag cat ccc aaa gaa gtg cac ccc ctg gta cag tgc ccc ttc 

Thr Pro Lys His Pro Lys Glu Val His Pro Leu Val Gin Cys Pro Phe 

550 555 560 

aca aga aac ccc aga gat aac ctt ggc tgc tea tgt aac cct teg att 

Thr Arg Asn Pro Arg Asp Asn Leu Gly Cys Ser Cys Asn Pro Ser He 

565 570 575 580 

ttg ccg att gag gat ttt caa aca cag ttc aat ctg act gtg gca gaa 

Leu Pro He Glu Asp Phe Gin Thr Gin Phe Asn Leu Thr Val Ala Glu 

585 590 595 

gag aag att att aag cat gaa act tta ccc tat gga aga cct aga gtt 

Glu Lys He He Lys His Glu Thr Leu Pro Tyr Gly Arg Pro Arg Val 

600 605 610 

etc cag aag gaa aac acc ate tgt ctt ctt tec cag eac cag ttt atg 

Leu Gin Lys Glu Asn Thr He Cys Leu Leu Ser Gin His Gin Phe Met 

615 620 625 

agt gga tac age caa gac ate tta atg ccc ctt tgg aca tec tat acc 

Ser Gly Tyr Ser Gin Asp He Leu Met Pro Leu Trp Thr Ser Tyr Thr 

630 635 640 

gtg gac aga aat gac agt ttc tct acg gaa gac ttc tec aac tgt ctg 

Val Asp Arg Asn Asp Ser Phe Ser Thr Glu Asp Phe Ser Asn Cys Leu 

645 650 655 660 

tac cag gac ttt aga att cct ctt agt cct gtc cat aaa tgt tea ttt 

Tyr Gin Asp Phe Arg He Pro Leu Ser Pro Val His Lys Cys Ser Phe 

665 670 675 

tat aaa aat aac acc aaa gtg agt tac ggg ttc etc tec cca cca caa 

Tyr Lys Asn Asn Thr Lys Val Ser Tyr Gly Phe Leu Ser Pro Pro Gin 

680 685 690 

eta aat aaa aat tea agt gga ata tat tct gaa get ttg ctt act aca 

Leu Asn Lys Asn Ser Ser Gly He Tyr Ser Glu Ala Leu Leu Thr Thr 

695 700 705 

aat ata gtg cca atg tac cag agt ttt caa gtt ata tgg cgc tac ttt 

Asn He Val Pro Met Tyr Gin Ser Phe Gin Val He Trp Arg Tyr Phe 
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710 715 720 

cat gac acc eta ctg cga oag tat get gaa gaa aga aat ggt gtc aat 2 383 

His Asp Thr Leu Leu Arg Lys Tyr Ala Glu Glu Arg Asn Gly Val Asn 
725 730 735 740 

gtc gtc agt ggt cct gtg ttt gac ttt gat tat gat gga cgt tgt gat 2 431 

Val Val Ser Gly Pro Val Phe Asp Phe Asp Tyr Asp Gly Arg Cys Asp 
745 750 755 

tec tta gag aat ctg agg caa aaa aga aga gtc ate cgt aac caa gaa 2 47 9 
Ser Leu Glu Asn Leu Arg Gin Lys Arg Arg Val lie Arg Asn Gin Glu 
760 765 770 

att ttg att cca act cac ttc ttt att gtg eta aca age tgt aaa gat 2527 
He Leu lie Pro Thr His Phe Phe He Val Leu Thr Ser Cys Lys Asp 
775 780 785 

aca tct cag acg cct ttg cac tgt gaa aac eta gac acc tta get ttc 2575 
Thr Ser Gin Thr Pro Leu His Cys Glu Asn Leu Asp Thr Leu Ala Phe 
790 795 800 

att ttg cct cac agg act gat aac age gag age tgt gtg cat ggg aag 2623 
He Leu Pro His Arg Thr Asp Asn Ser Glu Ser Cye Val His Gly Lya 
B05 810 815 820 

cat gac tec tea tgg gtt gaa gaa ttg tta atg tta cac aga gca egg 2671 
His Asp Ser Ser Trp Val Glu Glu Leu Leu Met Leu His Arg Ala Arg 
825 830 835 

ate aca gat gtt gag cac ate act gga etc age ttc tat caa caa aga 2719 
He Thr Asp Val Glu His He Thr Gly Leu Ser Phe Tyr Gin Gin Arg 
840 845 8S0 

aaa gag cca gtt tea gac att tta aag ttg aaa aca cat ttg cca acc 2767 
Lys Glu Pro Val Ser Asp He Leu Lys Leu Lys Thr His Leu Pro Thr 
855 860 865 

ttt age caa gaa gac tga tatgtttttt atccccaaac accatgaatc 2B15 
Phe Ser Gin Glu Asp 
870 



tttttgagag 


aaccttatat 


tttatatagt 


cctctagcta cactattgea 


ttgttcagaa 


2875 


actgtcgacc 


agagttagaa 


cggagccctc 


ggtgatgcgg acatctcagg 


gaaacttgcg 


2935 


tactcagcac 


agcagtggag 


agtgttcctg 


ttgaatcttg cacatatttg 


aatgtgtaag 


2995 


cattgtatac 


attgatcaag 


ttcgggggaa 


taaagacaga ccacacctaa 


aactgecttt 


3055 


ctgcttctct 


taaaggagaa 


gtagctgtga 


acattgtctg gataccagat 


atttgaatct 


3115 


ttcttactat 


tggtaataaa 


ccttgatgge 


attgggcaaa cagtagactt 


atagtagggt 


3175 


tggggtagee 


catgttatgt 


goctatcttt 


atgagaattt taaagtggtt 


ctggatatct 


3235 


tttaacttgg 


agt ttc at tt 


cttttcattg 


taatcaaaaa aaaaattaac 


agaagecaaa 


3295 


atacttctga 


gaccttgttt 


caatctttgc 


tgtatatccc ctcaaaatcc 


aagttattaa 


3355 


tcttatgtgt 


tttcttttta 


attttttgat 


tggatttctt tagatttaat 


ggttcaaatg 


3415 


agttcaactt 


tgagggacga 


tctttgaata 


tacttaccta ttataaaatc 


ttaetttgta 


3475 


tttgtattta 


a 








3486 



<210> SEQ ID NO 2 

<2U> LENGTH: 873 

<212> TYPE: PRT 

<213> ORGANISM: H. sapiens 

<4 00> SEQUENCE: 2 

Met Asp Val Gly Glu Glu Pro Leu Glu Lys Ala Ala Arg Ala Arg Thr 
15 10 15 



Ala Lys Asp Pro Asn Thr Tyr Lys Val Leu Ser Leu Val Leu Ser Val 
20 2S 30 
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Cys Val Leu Thr Thr lie Leu Gly Cys lie Phe Gly Leu Lye Pro Ser 

35 40 45 

Cys Ala Lys Glu Val Lys Ser Cys Lys Gly Arg Cys Phe Glu Arg Thr 
50 55 60 

Phe Gly Asn CyB Arg Cys Asp Ala Ala Cys Val Glu Leu Gly Asn Cye 
65 70 75 80 

Cye Leu Asp Tyr Gin Glu Thr Cys He Glu Pro Glu His He Trp Thr 



Cys Asn Lys Phe Arg Cys Gly Glu Lys Arg Leu Thr Arg Ser Leu Cys 
100 105 110 

Ala Cys Ser Asp Asp Cys Lys Asp Lys Gly Asp Cys Cys lie Asn Tyr 
115 120 125 

Ser Ser Val Cye Gin Gly Glu Lys Ser Trp Val Glu Glu Pro Cys Glu 
130 135 140 

Ser He Asn Glu Pro Gin Cys Pro Ala Gly Phe Glu Thr Pro Pro Thr 
145 150 155 160 

Leu Leu Phe Ser Leu Asp Gly Phe Arg Ala Glu Tyr Leu His Thr Trp 
165 170 175 

Gly Gly Leu Leu Pro Val He Ser Lys Leu Lys Lys Cys Gly Thr Tyr 
180 185 190 

Thr Lys Asn Met Arg Pro Val Tyr Pro Thr Lys Thr Phe Pro Asn His 
195 200 205 

Tyr Ser He Val Thr Gly Leu Tyr Pro Glu Ser His Gly He He Asp 
210 215 220 

Asn Lys Met Tyr Asp Pro Lys Met Asn Ala Ser Phe Ser Leu Lys Ser 
225 230 235 240 

Lys Glu Lys Phe Asn Pro Glu Trp Tyr Lys Gly Glu Pro He Trp Val 
245 250 255 

Thr Ala Lys Tyr Gin Gly Leu Lys Ser Gly Thr Phe Phe Trp Pro Gly 
260 265 270 

Ser Asp Val Glu He ABn Gly He Phe Pro Asp He Tyr Lys Met Tyr 
275 280 285 

Asn Gly Ser Val Pro Phe Glu Glu Arg He Leu Ala Val Leu Gin Trp 
290 295 300 

Leu Gin Leu Pro Lys Asp Glu Arg Pro His Phe Tyr Thr Leu Tyr Leu 
305 310 315 320 

Glu Glu Pro Asp Ser Ser Gly His Ser Tyr Gly Pro Val Ser Ser Glu 
325 330 335 

Val He Lys Ala Leu Gin Arg Val Asp Gly Met Val Gly Met Leu Met 
340 345 350 

Asp Gly Leu Lys Glu Leu Asn Leu His Arg Cys Leu Asn Leu He Leu 
355 360 365 

He Ser Asp His Gly Met Glu Gin Gly Ser Cys Lys Lys Tyr He Tyr 
370 375 380 

Leu Asn Lys Tyr Leu Gly Asp Val Lys Asn He Lys Val He Tyr Gly 
385 390 395 400 

Pro Ala Ala Arg Leu Arg Pro Ser Asp Val Pro Asp Lys Tyr Tyr Ser 
405 410 415 

Phe Asn Tyr Glu Gly He Ala Arg Asn Leu Ser Cys Arg Glu Pro Asn 
420 425 430 

Gin His Phe Lys Pro Tyr Leu Lys His Phe Leu Pro Lys Arg Leu Hia 
435 440 445 
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Phe Ala Lye Ser Asp Arg lie Glu Pro Leu Thr Phe Tyr Leu Asp Pro 
450 455 460 

Gin Trp Gin Leu Ala Leu Asn Pro Ser Glu Arg Lys Tyr Cys Gly Ser 
465 4 70 475 4 90 

Gly Phe His Gly Ser Asp Asn Val Phe Ser Asn Met Gin Ala Leu Phe 
485 490 495 

Val Gly Tyr Gly Pro Gly Phe Lye Hie Gly lie Glu Ala Aep Thr Phe 
500 505 510 

Glu Asn lie Glu Val Tyr Asn Leu Met Cys Asp Leu Leu Asn Leu Thr 
515 520 525 

Pro Ala Pro Asn Asn Gly Thr His Gly Ser Leu Asn His Leu Leu Lys 
530 535 540 

Asn Pro Val Tyr Thr Pro Lys Hie Pro Lye Glu Val His Pro Leu Val 
545 550 555 560 

Gin Cys Pro Phe Thr Arg Asn Pro Arg Aop Asn Leu Gly Cys Ser Cys 
565 570 575 

Asn Pro Ser lie Leu Pro lie Glu Asp Phe Gin Thr Gin Phe Asn Leu 
580 585 590 

Thr Val Ala Glu Glu Lys lie He Lys His Glu Thr Leu Pro Tyr Gly 
595 600 605 

Arg Pro Arg Val Leu Gin Lys Glu Asn Thr He Cys Leu Leu Ser Gin 
610 615 620 

His Gin Phe Met Ser Gly Tyr Ser Gin Asp He Leu Met Pro Leu Trp 
625 630 635 640 

Thr Ser Tyr Thr Val Asp Arg Asn Asp Ser Phe Ser Thr Glu Asp Phe 
645 650 655 

Ser Asn Cys Leu Tyr Gin Aep Phe Arg lie Pro Leu Ser Pro Val His 
660 665 670 

Lys Cys Ser Phe Tyr Lys Asn Asn Thr Lys Val Ser Tyr Gly Phe Leu 
675 680 685 

Ser Pro Pro Gin Leu Asn Lys Asn Ser Ser Gly He Tyr Ser Glu Ala 
690 695 700 

Leu Leu Thr Thr Asn He Val Pro Met Tyr Gin Ser Phe Gin Val He 
705 710 715 720 

Trp Arg Tyr Phe His Asp Thr Leu Leu Arg Lys Tyr Ala Glu Glu Arg 
725 730 735 

Asn Gly Val Asn Val Val Ser Gly Pro Val Phe Asp Phe Asp Tyr Asp 
740 745 750 

Gly Arg Cys Asp Ser Leu Glu Asn Leu Arg Gin Lys Arg Arg Val He 
755 760 765 

Arg Asn Gin Glu He Leu He Pro Thr His Phe Phe He Val Leu Thr 
770 775 780 

Ser Cys Lys Asp Thr Ser Gin Thr Pro Leu His Cys Glu Asn Leu Asp 
785 790 795 800 

Thr Leu Ala Phe He Leu Pro His Arg Thr Asp Asn Ser Glu Ser Cys 
805 B10 815 

Val His Gly Lys His Asp Ser Ser Trp Val Glu Glu Leu Leu Met Leu 
820 825 830 

His Arg Ala Arg He Thr Asp Val Glu His He Thr Gly Leu Ser Phe 
835 840 845 

Tyr Gin Gin Arg Lys Glu Pro Val Ser Asp He Leu Lys Leu Lys Thr 
850 855 860 

His Leu Pro Thr Phe Ser Gin Glu Asp 
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<210> SEQ ID NO 3 

<211> LENGTH : 3486 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 

<220> FEATURE: 

<221> NAME/KEY: CDS 

<222> LOCATION: ( 164 )... (2785) 

<400> SEQUENCE: 3 

ggccacgatg gagcgcgacg gctgcgcggg gggcgggagc cgcggcggcg agggcgggcg 60 

cgctccccgg gagggcccgg cggggaacgg ccgcgatcgg ggccgcagcc acgctgccga 120 

ggcgcccggg gacccgcagg cggccgcgtc cttgctggcc cct atg gac gtg ggg 175 

Met Asp Val Gly 



gag gag ccg ctg gag aag gcg gcg cgc gcc cgc act gee aag gac ccc 

Glu Glu Pro Leu Glu Lys Ala Ala Arg Ala Arg Thr Ala Lys Asp Pro 

S 10 15 20 

aac acc tat aaa gta etc teg ctg gta ttg tea gta tgt gtg tta aca 

Asn Thr Tyr Lys Val Leu Ser Leu Val Leu Ser Val Cys Val Leu Thr 

25 30 " 35 

aca ata ctt ggt tgt ata ttt ggg ttg aaa cca age tgt gcc aaa gaa 

Thr lie Leu Gly Cys lie Phe Gly Leu Lys Pro Ser Cys Ala Lys Glu 

40 45 50 

gtt aaa agt tgc aaa ggt cgc tgt ttc gag aga aca ttt ggg aac tgt 

Val Lys Ser Cys Lys Gly Arg Cys Phe Glu Arg Thr Phe Gly Asn Cys 

55 60 65 

cgc tgt gat get gcc tgt gtt gag ctt gga aac tgc tgt tta gat tac 

Arg Cys Asp Ala Ala Cys Val Glu Leu Gly Asn Cys Cys Leu Asp Tyr 



cag gag acg tgc ata gaa cca gaa cat ata tgg act tgc aac aaa ttc 

Gin Glu Thr Cys He Glu Pro Glu His He Trp Thr Cys Asn Lys Phe 

85 90 95 100 

agg tgt ggt gag aaa agg ttg acc aga age etc tgt gcc tgt tea gat 

Arg Cys Gly Glu Lys Arg Leu Thr Arg Ser Leu Cys Ala Cys Ser Asp 

105 110 115 

gac tgc aag gac cag ggc gac tgc tgc ate aac tac agt tct gtg tgt 

Asp Cys Lys Asp Gin Gly Asp Cys Cys He Asn Tyr Ser Ser Val Cys 
120 125 130 

caa ggt gag aaa agt tgg gta gaa gaa cca tgt gag age att aat gag 

Gin Gly Glu Lys Ser Trp Val Glu Glu Pro Cys Glu Ser lie Asn Glu 

135 140 145 

cca cag tgc cca gca ggg ttt gaa acg cct cct acc etc tta ttt tct 

Pro Gin Cys Pro Ala Gly Phe Glu Thr Pro Pro Thr Leu Leu Phe Ser 
150 155 160 

ttg gat gga ttc agg gca gaa tat tta cac act tgg ggt gga ctt ctt 

Leu Asp Gly Phe Arg Ala Glu Tyr Leu His Thr Trp Gly Gly Leu Leu 

165 170 175 180 

cct gtt att age aaa eta aaa aaa tgt gga aca tat act aaa aac atg 

Pro Val He Ser Lys Leu Lys Lys CyB Gly Thr Tyr Thr Lys Asn Met 

185 190 195 

aga ccg gta tat cca aca aaa act ttc ccc aat cac tac age att gtc 

Arg Pro Val Tyr Pro Thr Lys Thr Phe Pro Asn His Tyr Ser He Val 
200 205 210 

acc gga ttg tat cca gaa tct cat ggc ata ate gac aat aaa atg tat 

Thr Gly Leu Tyr Pro Glu Ser His Gly He He Asp Asn Lys Met Tyr 

215 220 225 



gat ccc aaa atg aat get tec ttt tea ctt aaa agt aaa gag aaa ttt 
Asp Pro Lys Met Asn Ala Ser Phe Ser Leu Lys Ser Lys Glu Lys Phe 



895 
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aat cct gag tgg tac aaa gga gaa cca att tgg gtc ace get aag tat 
Abu Pro Glu Trp Tyr Lys Gly Glu Pro lie Trp Val Thr Ala Lys Tyr 
245 250 255 260 

caa ggc etc aag tct ggc aca ttt ttc tgg cca gga tea gat gtg gaa 
Gin Gly Leu Lys Ser Gly Thr Phe Phe Trp Pro Gly Ser Asp Val Glu 
265 270 275 

att aac gga att ttc cca gac ate tat aaa atg tat aat ggt tea gta 
lie Asn Gly He Phe Pro Asp He Tyr Lys Met Tyr Asn Gly Ser Val 
280 2G5 290 

cca ttt gaa gaa agg att tta get gtt ctt cag tgg eta cag ctt cct 
Pro Phe Glu Glu Arg He Leu Ala Val Leu Gin Trp Leu Gin Leu Pro 
295 300 305 

aaa gat gaa aga cca cac ttt tac act ctg tat tta gaa gaa cca gat 
Lys Asp Glu Arg Pro His Phe Tyr Thr Leu Tyr Leu Glu Glu Pro Asp 
310 315 320 

tct tea ggt cat tea tat gga cca gtc age agt gaa gtc ate aaa gec 
Ser Ser Gly His Ser Tyr Gly Pro Val Ser Ser Glu Val He Lys Ala 
325 330 335 340 

ttg cag agg gtt gat ggt atg gtt ggt atg ctg atg gat ggt ctg aaa 
Leu Gin Arg Val Asp Gly Met Val Gly Met Leu Met Asp Gly Leu Lys 
345 350 355 

gag ctg aac ttg cac aga tgc ctg aac etc ate ctt att tea gat cat 
Glu Leu Asn Leu His Arg Cys Leu Asn Leu He Leu He Ser Asp His 
360 365 370 

ggc atg gaa caa ggc agt tgt aag aaa tac ata tat ctg aat aaa tat 
Gly Met Glu Gin Gly Ser Cys Lys Ly s Tyr He Tyr Leu Asn Lys Tyr 
375 380 385 

ttg ggg gat gtt aaa aat att aaa gtt ate tat gga cct gca get cga 
Leu Gly Asp Val Lys ABn He Lys Val He Tyr Gly Pro Ala Ala Arg 
390 395 400 

ttg aga ccc tct gat gtc cca gat aaa tac tat tea ttt aac tat gaa 
Leu Arg Pro Ser Asp Val Pro Aap Lys Tyr Tyr Ser Phe Asn Tyr Glu 
405 410 415 420 

ggc att gee cga aat ctt tct tgc egg gaa cca aac cag cac ttc aaa 
Gly He Ala Arg Asn Leu Ser Cye Arg Glu Pro Asn Gin His Phe Lys 
425 430 435 

cct tac ctg aaa cat ttc tta cct aag cgt ttg cac ttt get aag agt 
Pro Tyr Leu Lys His Phe Leu Pro Lys Arg Leu His Phe Ala Lys Ser 
440 445 450 

gat aga att gag ccc ttg aca ttc tat ttg gac cct cag tgg caa ctt 
Asp Arg He Glu Pro Leu Thr Phe Tyr Leu Asp Pro Gin Trp Gin Leu 
455 460 465 

gca ttg aat ccc tea gaa agg aaa tat tgt gga agt gga ttt cat ggc 
Ala Leu Asn Pro Ser Glu Arg Lys Tyr Cys Gly Ser Gly Phe His Gly 
470 475 480 

tct gac aat gta ttt tea aat atg caa gee etc ttt gtt ggc tat gga 
Ser Asp Asn Val Phe Ser Asn Met Gin Ala Leu Phe Val Gly Tyr Gly 
485 490 495 500 

cct gga ttc aag cat ggc att gag get gac acc ttt gaa aac att gaa 
Pro Gly Phe Lys His Gly He Glu Ala Asp Thr Phe Glu Asn He Glu 
505 510 515 

gtc tat aac tta atg tgt gat tta ctg aat ttg aca ccg get cct aat 
Val Tyr Asn Leu Met Cys Asp Leu Leu Asn Leu Thr Pro Ala Pro Asn 

520 . 525 530 

aac gga act cat gga agt ctt aac cac ctt eta aag aat cct gtt tat 
Asn Gly Thr His Gly Ser Leu Asn His Leu Leu Lys Asn Pro Val Tyr 
535 540 545 

acg cca aag cat ccc aaa gaa gtg cac ccc ctg gta cag tgc ccc ttc 
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Thr Pro Lys His Pro Lys Glu Val His Pro Leu Val Gin Cys Pro Phe 
550 555 560 

oca aga aac ccc aga gat aac ctt ggc tgc tea tgt aac cct teg att 
Thr Arg Asn Pro Arg Asp Aen Lou Gly Cys Ser Cys Asn Pro Ser lie 
565 570 575 580 

ttg ccg att gag gat ttt caa aea cag ttc aat ctg act gtg gca gaa 
Leu Pro lie Glu Asp Phe Gin Thr Gin Phe Asn Leu Thr Val Ala Glu 
585 590 595 

gag aag att att aag cat gaa act tta ccc tat gga aga cct aga gtt 
Glu Lys lie lie Lys His Glu Thr Leu Pro Tyr Gly Arg Pro Arg Val 
600 605 610 

etc cag aag gaa aac acc ate tgt ctt ctt tec cag cac cag ttt atg 
Leu Gin Lys Glu Asn Thr lie Cys Leu Leu Ser Gin His Gin Phe Met 
615 620 625 

agt gga tac age caa gac ate tta atg ccc ctt tgg aca tec tat acc 
Ser Gly Tyr Ser Gin Asp lie Leu Met Pro Leu Trp Thr Ser Tyr Thr 
630 635 640 

gtg gac aga aat gac agt ttc tct acg gaa gac ttc tec aac tgt ctg 
Val Asp Arg Asn Asp Ser Phe Ser Thr Glu Asp Phe Ser Asn Cys Leu 
645 650 655 660 

tac cag gac ttt aga att cct ctt agt cct gtc cat aaa tgt tea ttt 
Tyr Gin Asp Phe Arg He Pro Leu Ser Pro Val His Lys Cys Ser Phe 
665 670 ' 675 

tat aaa aat aac acc aaa gtg agt tac ggg ttc etc tec cca cca caa 
Tyr Lys Asn Asn Thr Lys Val Ser Tyr Gly Phe Leu Ser Pro Pro Gin 
680 685 690 

eta aat aaa aat tea agt gga ata tat tct gaa get ttg ctt act aca 
Leu Asn Lys Asn Ser Ser Gly He Tyr Ser Glu Ala Leu Leu Thr Thr 
695 700 705 

aat ata gtg cca atg tac cag agt ttt caa gtt ata tgg cgc tac ttt 
Asn He Val Pro Met Tyr Gin Ser Phe Gin Val He Trp Arg Tyr Phe 
710 715 720 

cat gac acc eta ctg cga aag tat get gaa gaa aga aat ggt gtc aat 
His Asp Thr Leu Leu Arg Lys Tyr Ala Glu Glu Arg Asn Gly Val Asn 
725 730 735 740 

gtc gtc agt ggt cct gtg ttt gac ttt gat tat gat gga cgt tgt gat 
Val Val Ser Gly Pro Val Phe Asp Phe Asp Tyr Asp Gly Arg Cys Asp 
745 750 755 

tec tta gag aat ctg agg caa aaa aga aga gtc ate cgt aac caa gaa 
Ser Leu Glu Asn Leu Arg Gin Lys Arg Arg Val He Arg Asn Gin Glu 
760 765 770 

att ttg att cca act cac ttc ttt att gtg eta aca age tgt aaa gat 
He Leu He Pro Thr His Phe Phe He Val Leu Thr Ser Cys Lys Asp 
775 780 785 

aca tct cag acg cct ttg cac tgt gaa aac eta gac acc tta get ttc 
Thr Ser Gin Thr Pro Leu His Cys Glu Asn Leu Asp Thr Leu Ala Phe 
790 795 800 

att ttg cct cac agg act gat aac age gag age tgt gtg cat ggg aag 
He Leu Pro His Arg Thr Asp Asn Ser Glu Ser Cys Val His Gly Lys 
B05 810 815 820 

cat gac tec tea tgg gtt gaa gaa ttg tta atg tta cac aga gca egg 
His Asp Ser Ser Trp Val Glu Glu Leu Leu Met Leu His Arg Ala Arg 
825 830 835 

ate aca gat gtt gag cac ate act gga etc age ttc tat caa caa aga 
He Thr Asp Val Glu His He Thr Gly Leu Ser Phe Tyr Gin Gin Arg 
840 845 850 



aaa gag cca gtt tea gac att tta aag ttg aaa aca cat ttg cca acc 
Lys Glu Pro Val Ser Asp He Leu Lys Leu Lys Thr His Leu Pro Thr 
855 860 865 
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ttt age caa gaa gac tga tatgtttttt atccccaaac accatgaatc 
Phe Ser Gin Glu Asp 



870 














tttttgagag 


aaccttatat 


tttatatagt 


cctctagcta 


cactattgea 


ttgttcagaa 


2875 


actgtcgacc 


agagttagaa 


cggagccctc 


ggtgatgcgg 


acatctcagg 


gaaacttgcg 


2935 


tactcagcac 


agcagtggag 


agtgttcctg 


ttgaatcttg 


cacatatttg 


aatgtgtaag 


2995 


cattgtatac 


attgatcaag 


ttcgggggaa 


taaagacaga 


ccacacctaa 


aactgecttt 


3055 


ctgcttctct 


taaaggagaa 


gtagctgtga 


acattgtctg 


gataccagat 


atttgaatct 


3115 


ttcttactat 


tggtaataaa 


ccttgatggc 


attgggcaaa 


cagtagactt 


atagtagggt 


3175 


tggggtagee 


catgttatgt 


gactatcttt 


atgagaattt 


taaagtggtt 


ctggatatct 


3235 


tttaacttgg 


agtttcattt 


cttttcattg 


taatcaaaaa 


aaaaattaac 


agaagecaaa 


3295 


atacttctga 


gaccttgttt 


caatctttgc 


tgtatatccc 


ctcaaaatcc 


aagttattaa 


3355 


tcttatgtgt 


tttcttttta 


attttttgat 


tggatttctt 


tagatttaat 


ggttcaaatg 


3415 


egttcaactt 


tgagggacga 


tctttgaata 


tacttaccta 


ttataaaatc 


ttactttgta 


3475 


tttgtattta 


a 










3486 



<210> SEQ ID NO 4 

<211> LENGTH: 873 

<212> TYPE: PBT 

<213> ORGANISM: H. sapienB 

<4 00> SEQUENCE: 4 

Met Asp Val Gly Glu Glu Pro Leu Glu Lys Ala Ala Arg Ala Arg Thr 
15 10 15 

Ala Lys Asp Pro Aan Thr Tyr Lya Val Leu Ser Leu Val Leu Ser Val 



Cys Val Leu Thr Thr lie Leu Gly Cys He Phe Gly Leu Lys Pro Ser 
35 40 45 

Cys Ala Lys Glu Val Lys Ser Cys LyB Gly Arg CyB Phe Glu Arg Thr 



Phe Gly Asn Cys Arg Cys Asp Ala Ala Cys Val Glu Leu Gly Asn Cys 
65 70 75 ' 80 

Cys Leu Asp Tyr Gin Glu Thr Cys He Glu Pro Glu His He Trp Thr 
85 90 95 

Cys Asn Lys Phe Arg Cys Gly Glu Lys Arg Leu Thr Arg Ser Leu Cys 
100 105 110 

Ala Cys Ser Asp Asp Cys Lys Asp Gin Gly Asp Cys Cys lie Asn Tyr 
115 120 125 

Ser Ser Val Cys Gin Gly Glu Lys Ser Trp Val Glu Glu Pro Cys Glu 

130 135 140 

Ser He Asn Glu Pro Gin Cys Pro Ala Gly Phe Glu Thr Pro Pro Thr 
145 ISO 155 160 

Leu Leu Phe Ser Leu Asp Gly Phe Arg Ala Glu Tyr Leu His Thr Trp 
165 170 175 

Gly Gly Leu Leu Pro Val He Ser Lys Leu Lys Lys Cys Gly Thr Tyr 
180 185 190 

Thr Lys Asn Met Arg Pro Val Tyr Pro Thr Lys Thr Phe Pro Asn His 
195 200 205 

Tyr Ser He Val Thr Gly Leu Tyr Pro Glu Ser His Gly He He Asp 

210 215 220 

Asn Lys Met Tyr Asp Pro Lys Met Asn Ala Ser Phe Ser Leu Lys Ser 
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225 



230 



235 



240 



Lys Glu Lye Phe Abu Pro Glu Trp Tyr Lye Gly Glu Pro lie Trp Val 
245 250 255 

Thr Ala Lye Tyr Gin Gly Leu Lys Ser Gly Thr Phe Phe Trp Pro Gly 
260 265 270 

Ser Asp Val Glu lie Asn Gly lie Phe Pro Asp He Tyr Lys Met Tyr 
275 280 285 

Asn Gly Ser Val Pro Phe Glu Glu Arg He Leu Ala Val Leu Gin Trp 
290 295 300 

Leu Gin Leu Pro Lys Asp Glu Arg Pro His Phe Tyr Thr Leu Tyr Leu 
305 310 315 320 

Glu Glu Pro Asp Ser Ser Gly His Ser Tyr Gly Pro Val Ser Ser Glu 
325 330 335 

Val He Lye Ala Leu Gin Arg Val Aep Gly Met Val Gly Met Leu Met 
340 345 350 

Asp Gly Leu Lys Glu Leu Asn Leu His Arg Cys Leu Asn Leu He Leu 
355 360 365 

He Ser Asp His Gly Met Glu Gin Gly Ser Cys Lys Lye Tyr He Tyr 
370 375 380 

Leu Asn Lye Tyr Leu Gly Aep Val Lys Asn He Lys Val He Tyr Gly 
385 390 395 400 

Pro Ala Ala Arg Leu Arg Pro Ser Asp Val Pro Asp Lys Tyr Tyr Ser 
405 410 415 

Phe Aen Tyr Glu Gly He Ala Arg Aen Leu Ser Cys Arg Glu Pro Asn 
420 425 430 

Gin His Phe Lys Pro Tyr Leu Lys Hie Phe Leu Pro Lys Arg Leu His 
435 440 445 

Phe Ala Lys Ser Asp Arg He Glu Pro Leu Thr Phe Tyr Leu Asp Pro 
450 455 460 

Gin Trp Gin Leu Ala Leu Asn Pro Ser Glu Arg Lye Tyr Cys Gly Ser 
465 470 475 460 

Gly Phe His Gly Ser Aep Aen Val Phe Ser Aen Met Gin Ala Leu Phe 
485 490 495 

Val Gly Tyr Gly Pro Gly Phe Lys His Gly He Glu Ala Asp Thr Phe 
500 505 510 

Glu Asn He Glu Val Tyr Aen Leu Met Cys Asp Leu Leu Asn Leu Thr 

515 520 525 

Pro Ala Pro Asn Asn Gly Thr Hie Gly Ser Leu Asn His Leu Leu Lye 
530 535 540 

Aen Pro Val Tyr Thr Pro Lys His Pro Lys Glu Val His Pro Leu Val 
545 550 555 560 

Gin Cys Pro Phe Thr Arg ABn Pro Arg Asp Asn Leu Gly Cye Ser Cye 
565 570 575 

Asn Pro Ser He Leu Pro He Glu Aep Phe Gin Thr Gin Phe Aen Leu 
580 585 590 

Thr Val Ala Glu Glu Lys He He Lys His Glu Thr Leu Pro Tyr Gly 
595 600 605 

Arg Pro Arg Val Leu Gin Lys Glu Asn Thr He Cys Leu Leu Ser Gin 
610 615 620 

Hie Gin Phe Met Ser Gly Tyr Ser Gin Aep He Leu Met Pro Leu Trp 
625 630 635 640 

Thr ser Tyr Thr Val Aep Arg Aen Aep Ser Phe Ser Thr Glu Asp Phe 



645 



650 



655 
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Ser Aen Cye Leu Tyr Gin Asp Phe Arg He Pro Leu Ser Pro Val Hie 
£60 665 670 

Lys Cys Ser Phe Tyr Lye Aen Asn Thr Lys Val Ser Tyr Gly Phe Leu 
675 680 685 

Ser Pro Pro Gin Leu Aen Lye Asn Ser Ser Gly He Tyr Ser Glu Ala 
690 695 700 

Leu Leu Thr Thr Asn He Vol Pro Met Tyr Gin Ser Phe Gin Val He 
705 710 715 720 

Trp Arg Tyr Phe His Asp Thr Leu Leu Arg Lys Tyr Ala Glu Glu Arg 
725 730 735 

Asn Gly Val Asn Val Val Ser Gly Pro Val Phe Asp Phe Aep Tyr Asp 
740 745 750 

Gly Arg Cye Asp Ser Leu Glu Asn Leu Arg Gin Lye Arg Arg Val He 
755 760 765 

Arg Asn Gin Glu He Leu He Pro Thr His Phe Phe He Val Leu Thr 
770 775 780 

Ser Cys Lys Asp Thr Ser Gin Thr Pro Leu His Cys Glu Asn Leu Asp 
785 790 795 800 

Thr Leu Ala Phe He Leu Pro His Arg Thr Asp Asn Ser Glu Ser Cys 
805 810 815 

Val Hie Gly Lys His Aep Ser Ser Trp Val Glu Glu Leu Leu Met Leu 
820 825 830 

His Arg Ala Arg He Thr Asp Val Glu Hie He Thr Gly Leu Ser Phe 
835 840 845 

Tyr Gin Gin Arg Lys Glu Pro Val Ser Asp He Leu Lys Leu Lys Thr 
850 855 860 

His Leu Pro Thr Phe Ser Gin Glu Asp 
865 870 



<210> SEQ ID NO 5 

<211> LENGTH: 646 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 

<220> FEATURE: 

<221> NAME/KEY: promoter 

<222> LOCATION: (0)...<0) 

<223> OTHER INFORMATION: PC-1 promoter sequence 
<220> FEATURE: 

<221> NAME/KEY: misc_f eature 

<222> LOCATION: (1)...(646) 

<223> OTHER INFORMATION: n - A,T,C or G 






<400> SEQUI 


INCE: 5 












aaaccaacgt 


agngacgtgg 


gaatcgaaat 


atccttaggt 


gtgttcagta 


tatgtgaacc 


60 


cacgtatttt 


aagtggacga 


tttctctctc 


agagtaccgt 


aggtagtggg 


ggacggggcg 


120 


cagaggggga 


gaaacagaaa 


gtcgccttcc 


tccatggttc 


atttgcattt 


ccatccagaa 


180 


actcacaggt 


cgaccccaag 


actccactct 


ctcccgcctt 


tgagaagccg 


gaccggcatc 


240 


ggcggctgca 


tccttctcct 


cctccccgct 


ctattttggg 


gccccatgat 


ctcatgcctt 


300 


ctgcagacca 


cacgctgcaa 


ttccagccca 


gcccgcgccg 


cgaggccacg 


cagggcgatt 


360 


cctgcaagtg 


tcgggagggt 


ggccggggcg 


cggggagggg 


acggcttggg 


gggaagttta 


420 


agacacgccc 


acgtaaggga 


cccaaaataa 


ccgacacaca 


gagtgcccga 


aatcagacag 


480 


gaagccaaat 


aatccggggc 


gttgagtcgc 


tttgccctga 


ctgcgagagc 


cgggtgtagg 


540 


gcggggagcc 


aaggatctga 


ccgcgagggg 


cgggcgcggc 


ggggaggggc 


ggggcggggc 


600 
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gggcggcgcg gggcctatta aaggcgcggc ggggcagcgg ggccgg 



646 



<210> SEQ ID NO 6 

<2U> LENGTH: 350 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 

<220> FEATURE: 

<221> NAME/KEY: 3 1 UTR 

<222> LOCATION: (0)...(0) 

<223> OTHER INFORMATION: allele "A" 

<400> SEQUENCE: 6 

agccaagaag actgatatgt tttttatccc caaacaccat gaatcttttt gagagaacct 60 

tatattttat atagtcctct agctacacta ttgcattgtt cagaaactgt cgaccagagt 120 

tagaacggag ccctcggtga tgcggacatc tcagggaaac ttgcgtactc agcacagcog 180 

tggagagtgt tcctgttgaa tcttgcacat atttgaatgt gtaagcattg tatacattga 240 

tcaagttcgg gggaataaag acagaccaca cctaaaactg cctttctgct tctcttaaag 300 

gagaagtagc tgtgaacatt gtctggatac cagatatttg aatctttctt 350 



<210> SEQ ID NO 7 

<211> LENGTH: 350 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 

<220> FEATURE: 

<221> NAME/KEY: 3'UTR 

<222> LOCATION: (0)...(0) 

<223> OTHER INFORMATION : allele "P" 

<400> SEQUENCE: 7 

agccaagaag actgatatgt tttttatccc caaacaccat gaatcttttt gagagaacct 60 

tatattttat atagtcctct agctacacta ttgcattgtt cagaaactgt cgaccagagt 120 

tagaacagag ccctccgtga tgcggacatc tcagggaaac ttgcgtactc agcacagtag 180 

tggagagtgt tcctgttgaa tcttgcacat atttgaatgt gtaagcattg tatacattga 24 0 

tcaagttcgg gggaataaag acagaccaca cctaaaactg cctttctgct tctcttaaag 300 

gagaagtagc tgtgaacatt gtctggatac cagatatttg aatctttctt 350 



<210> SEQ ID NO 8 

<211> LENGTH: 350 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 

<220> FEATURE: 

<221> NAME/KEY: 3'UTR 

<222> LOCATION: <0)...(0) 

<223> OTHER INFORMATION: allele "N" 

<400> SEQUENCE: 8 

agccaagaag actgatatgt tttttatccc caaacaccat gaatcttttt gagagaacct 60 

tatattttat atagtcctct agctacacta ttgcattgtt cagaaactgt cgaccagagt 120 

tagaacagag ccctcggtga tgcggacatc tcagggaaac ttgcgtactc agcacagtag 180 

tggagagtgt tcctgttgaa tcttgcacat atttgaatgt gtaagcattg tatacattga 24 0 

tcaagttcgg gggaataaag acagaccaca cctaaaactg cctttctgct tctcttaaag 300 



gagaagtagc tgtgaacatt 



gtctggatac cagatatttg aatctttctt 



350 



<210> SEQ ID NO 9 
<211> LENGTH: 23 
<212> TYPE: DNA 



<213> ORGANISM: H. sapiens 
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<400> SEQUENCE: 9 
ctgtgttcac tttggacatg ttg 

<210> SEQ ID NO 10 

<211> LENGTH: 22 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 

<4 00> SEQUENCE: 10 

gacgttggaa gataccaggt tg 

<210> SEQ ID NO 11 

<211> LENGTH: 109 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 

<220> FEATURE: 

<221> NAME/KEY: mi sc_ feature 

<222> LOCATION: (1)...(109) 

<223> OTHER INFORMATION: n - A,T,C or G 

<4 00> SEQUENCE: 11 

ctctcgctgg taggtccgcg gccaggcccc ggcgcccggg agggctggga atacngggag 
ggcggcgccg agctcctgcg ctctcagcgc actcagcacc gggcacgga 



<210> SEQ ID NO 12 

<211> LENGTH: 279 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 



<400> SEQUENCE: 12 



tgagctccac cgggccggcg 


gccgctctag aactagtgga tcatgccact gtaccctagc 


60 


ctgggtaaca gagtaagaca 


ctatctctaa 


aaataaaaaa taagataaaa tattttttaa 


120 


aaaagaaacc atgtaatttt 


ctcttttctc 


cctacaggta ttgagaaggt aattaggtgt 


180 


gtgtgtgtgt gtgtgtgtgt 


gtgtgtgtgt 


gtatgtgtgt gcacagcctt attaagaatg 


240 


tgattgaggt aaacattatc 


tcctattccc 


aaggggtac 


279 



<210> SEQ ID NO 13 

<211> LENGTH: 243 

<212> TYPE : DNA 

<213> ORGANISM: H. sapiens 

<400> SEQUENCE: 13 



agatttttgc cttactttat taccccatct gtattttcta aagtagtatt tgaacctagt 60 

gtacacctaa cttagttgta ttcgttgatg tttactttga attatataat gattagaaac 120 

atctgactta tcgttcaatt ttttcagtta accaggtaag gatgagcagg gaaaaaagtg 180 

gagttatggt cattaggaaa agatccacta gttctagagc ggccgccacg cccggtggag 240 

ctt 24 3 



<210> SEQ ID NO 14 

<211> LENGTH: 231 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 

<220> FEATURE: 

<221> NAME/KEY: misc_£eature 

<222> LOCATION: <1)...(231) 

<223> OTHER INFORMATION: n - A,T,C or G 



<400> SEQUENCE: 14 
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cgcggcggcc gttctagaac tagtggatca tactcacgaa gacagcaatt ctgtgttcac 60 

tttggacatg ttgaatttga gacataaaac acattttgct gatgtttgtt tctagaacat 120 

agtcaaggtc aggtgctcgt tgggctctgc agcaacctgg tatcttccaa cctcttaacg 180 

gggctntaca taagtgttat cttttatatt aagantcatg gctattgggc c 231 



<210> SEQ ID NO 15 

<211> LENGTH t 313 

<212> TYPE s DNA 

<213> ORGANISM: M. sapiens 

<400> SEQUENCE: 15 



aatctgttca catactttgt ttgtggaatc tgtcttaatg tgtctcacaa gcatcacaat 60 

tattattact gttaagtgtg ttcattttat tttcttgaaa atattttagg tgagaagcag 120 

ggtaagatta tattctgagg tattaatttt ttctttttta gaagtacagc atcatttttt 180 

tctttccaaa ttaagatgat aaaaataata aaatcactgg tttattaaac attacaggtt 24 0 

gagtatcctt tatccaaaat gtttggtatg agaactgttt tggattttgg acttttttgg 300 

attttgcaat att 313 



<210> SEQ ID NO 16 

<211> LENGTH: 313 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 

<400> SEQUENCE: 16 



ccgcagcccg ggggatcaca cagaccttag tggaaaatct tcactggacc tgtgccaaga 60 

agggggtaca tcttcattgg atatgtcttg tctttgcttc tttaaacatt tttttttctt 120 

tttcattacc caggtttgaa actaagtgag taacttcaga gtttactgct ggaatatcac 180 

catttcagtg agattgacta ggcaggcagt ctttcttgga aaagtactgg cagaacctaa 240 

ctgtttcact aaacttttct aatgggcaaa gtagttgaac cttgtgtagg gcgccttatc 300 

tttaataatg tga 313 



<210> SEQ ID NO 17 

<211> LENGTH: 382 

<212> TYPE : DNA 

<213> ORGANISM; H. sapiens 

<400> SEQUENCE: 17 



taagagaaaa atgaagtcat ctttaagatt ggatttgtat ccacagtgtt gctttataat 60 

tcatcctgaa tttttatctg attaaaatcc ctcctgggta atttttttta cgtgatttag 120 

actgctgtgg taccactgct aaatgaggta agccaattgt cagatgtatt taataacaat 180 

gtttattttt ttcccttcta gaaaaatgtt caccgtaagc tctgcatttc aacttctatc 240 

tgtttgaaga agtgagatgg gattgtaaca ttttttgagg gaatagattt aagataaaag 300 

aaaaacaact tattttccaa taggtagtta agtaaggaaa cccaggttct gatctttgct 360 

ctgccacaaa ctagctgtgg ct 382 



<210> SEQ ID NO 18 

<211> LENGTH: 312 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 

<400> SEQUENCE: 18 



actacataaa atcttaagag gttgcgtttt gccattacct gatttttttg tttttctttc 



60 



62 



cttaaactta ttataattcc atgtagcttc 


agttatcggt 


ttctttttga 


tgattttttt 


120 


ctgtgaatgt atttaacatt aagtaaacac 


aacttgcata 


taatctgttt 


tattrtttttt 


180 


agggattaac cagtgagttc tttgtttttc 


tactaaaata 


gttaattatt 


ctcatctatt 


240 


tcaatcagag taaaataacc egattctcta 


gagcttttaa 


taactgattt 


catttagtgt 


300 


gtctgtggcc at 








312 


<210> SEQ ID NO 19 

<211> LENGTH: 3B2 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 










<400> SEQUENCE: 19 










taatc-tctga ctatttaata tgttgttgct 


gcttaagagt 


catattacat 


gattattgtc 


60 


gtctaagtgc tgoagcttgt tgaccttaaa 


agcattctag 


cactagagag 


gaatgcattg 


120 


gtgtggtatg aaaacatact ttcctaagag 


atgaatgttg 


catgatttct 


taattttcct 


180 


tcattttctg ctccagattt ggaatgggta 


tgtgaaatga 


attttttcta 


ggatctgtaa 


240 


tatagaacag cttattctta tgtaatctcc 


tttttattga 


atcctgagct 


ttagcatttg 


300 


agtgatatgt tggctgaaaa atgagaactg 


aagaactctt 


tctcaaagag 


tttagataga 


360 


tggtaaatgg acagtaaaac ta 








382 


<210> SEQ ID NO 20 

<211> LENGTH : 307 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 










<400> SEQUENCE: 20 










gggaaaataa agttttcaaa taaaaccctt 


gatttcaaac 


acaatagatg 


cgaaatagca 


60 


tttactagct cttaatgaca ttttcaatga 


aaaaaactat 


attttacacc 


caaacaattg 


120 


tcagccatct tttatttttg tttgttcttc 


attttagttc 


agtaagatga 


aaggtctgta 


180 


ggcaattaat ttctattgta aatacttcgt 


tttgtagaaa 


tgatatacta 


ttttccccta 


240 


gactacaaca aaactttgct atttgctatg 


atgttttata 


tcgaaataaa 


ttctttagta 


300 


aatgatc 








307 


<210> SEQ ID NO 21 

<211> LENGTH: 329 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 










<400> SEQUENCE: 21 










gaatttcaaa gctgtaaatt atttctcagt 


agaactgtta 


caccagtgtt 


ataaaattta 


60 


atccctatca attgaggaat tattttttcc 


attctgtttt 


tcaatgtgtt 


cgtaaaatat 


120 


tacattttga tactgtttga tttagaccac 


acagtgagta 


agtacatttt 


tctcagtaat 


180 


tatttcatta aacccagtca tcaaactgaa 


cctcgctttg 


aaggaggctg 


ctagaccatt 


240 


ttataagatt ctatcatttc tggaaaaagc 


aagtattata 


cacaatatta 


ctaaatataa 


300 



ggatgcactt taaacaaaat aagagttgg 329 



<210> SEQ ID NO 22 

<211> LENGTH: 381 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 
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<400> SEQUENCE: 22 



gtcttagttt aatgtgaatc agctcattgt 



agttgcatcc 



actggcccaa 



atctatcaat 



60 



ctgtcggtct ttctttcttt ctttgtttct ttcttttttt ttttttttaa cagagatagc 120 

tttatgtata aatagccatt agtgtggaag gtatcacatg aggttgtgct tcccattctt 180 

aggtcatcat catggtaatc tgaatttgca ttatttactc ttcaggataa agggctgaag 240 

aaagtttact tgatggtttc ccaatttttt ctgaatgttg tagttaattc ttttttaaaa 300 

atgtagtttc ttatggacag tctttaggaa aaaaatacat taaatataaa atataagtga 360 



<210> SEQ ID NO 2 3 

<211> LENGTH: 382 

<212> TYPE : DNA 

<213> ORGANISM: H. sapiens 

<400> SEQUENCE: 23 

gattttgaaa aaagtgaagt gataggtaca gctgaaattc tgtcttacct atcagatctt 60 

caactaatat gagtgctaca cccatgttta acgaatttaa ccttggaagt gaaagaagtt 120 

ctgctctgca tattaaattt tttgttaaag ttacagcatg ttttgggatt ttttttttct 180 

cctaggcatg gtactattca tgtaagtata tctctgtgat aactttgaat atggtcatat 240 

taagaatacc ttcctttagg ccgggcacag tggctcatgc ctgtaatcgc agcactttgg 300 

gaggccaaag tgggtggtca cctgaggtca ggagttcgag accagcctgg ccaacatggt 360 

gaaaccctaa aaatacatat ac 382 

<210> SEQ ID NO 24 

<211> LENGTH: 361 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 

<400> SEQUENCE: 24 

gatccaaact ctgcatttaa ataccaaggc aggttttaaa gagttcattt aagtcattac 60 

attgtagcca ctgaaaggaa ttagacagac ctttagggat ctgacattct atatttttgt 120 

attatgtttt aattatagta tacaatcaac tattaattct tatgtttgtt cccctccagt 180 

taactatgaa ggcattgccc gaaatctttc tgtgagtato tttattttcc attatctagt 240 

tatttttact tttgtataat atatattgag agaaaagttt cagcatctat tattgggatt 300 

gaaggattag aatattttag taatctgggc caacatggaa atgctgtgta gtttaaagat 360 

c 361 

<210> SEQ ID NO 25 

<211> LENGTH: 384 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 

<4 00> SEQUENCE: 25 

ctgatgaaat gtttgtgaaa aaaaatttca tatgaagtta gaaagcaatt tcaagaaaag 60 

ttgacacttt ttatagatat tagggaaata tctttcccta ataaatatct ttccctaaaa 120 

aagttgacac ttttttagat attagggaaa taatagtttt tctttgctgt ttgcaatttc 180 

agtgccgggg cattgtaagt tctgacagtc tcccaggtaa acttagtctg atcggttagt 240 

gattcagggt aaccattggg ccctttctaa caatattgtt atgtgaaaac tgtataagta 300 

tgattetctt cactctaacc caggatttct aatgtcggcc tatggatgtt tgagttagat 360 



icagaaac c 



381 
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aattctttgt tgtggagagc tgtc 



384 



<210> SEQ ID NO 26 

<211> LENGTH: 381 

<212> TYPE : DNA 

<213> ORGANISM: H. sapiens 

<4 0 0> SEQUENCE: 26 

aaaagataga ggtgocttct taatgctttt caaagccagg tggttttatt taccgttgtg 60 

ttggtttaac aaaatagtta catacttttt aatcaatgaa aataatgtta tgattatcaa 120 

ttatgtttta tgaaaggact ttacattttt aattcatata tgtcaacatt aggaatgcaa 180 

gtgagtaaac ctattatact taattggatt aaatctaaag aaaaaatgat atgcaaagtt 240 

ttagacttga aaacatactg tgattatatg tcttgaatga gaattaatgg aacatacttt 300 

cataaagcta tttttctttg aacattaaag aattttgtts aagttttata ttcattggct 360 

attactaaaa agtcaaaaaa c 381 

<210> SEQ ID NO 27 

<211> LENGTH: 383 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 

<400> SEQUENCE: 27 

aaaactaaga gacctatcct agatgtcctt agattatgtg tgtgataggg ttaaaactat 60 

atttcccaca aagtccactg agcgtggtag ttttcctctt atcttatcat aaccagtttg 120 

tatatgtaca atgtggataa cagaattttt gggaccaact tgtagacagc tgaaatgcac 180 

tgataaactt ccttttctgg ccatctaggc cctgtgtggt aagtgtgaac aggtgccttt 240 

tttcccttct gaaaatagac ctgaaatagg attatcaaaa gcaggtcaca ttgtaggcaa 300 

ctttgtggag atgatggtga ggcaagacag atttttacct tcttcctgac tctcagactc 360 

actgaagaaa tgtggggaac atg 383 

<210> SEQ ID NO 28 

<211> LENGTH : 384 

<212> TYPE : DNA 

<213> ORGANISM: H. sapiens 

<4 00> SEQUENCE: 28 

catatcagta tttctattaa aaataaccta gtcttaaata ctctaaaacc caagagagtt 60 

ttatactttt attttagtta aagagtaaat gactcatgta tttggtttta aaaaagtaaa 120 

gatcatggca caagtctact atttgtttga tttgaaacat ctaagtaact ctaccatctt 18 0 

gaaattatgc agatttactt cggtaagtat cgtcaagaag tttggtccag tatgtatggt 24 0 

ttgatagcac cctctgcata gcatgtgetg taaaaatact taataatcaa attagaattt 300 

aggagtgggg gtaggtaaac atatgtttta attctagggg gcgeatgtaa atcttttgtg 360 

atatatcttt tctctttcta gttt 384 

<210> SEQ ID NO 29 

<2U> LENGTH: 339 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 

<400> SEQUENCE: 29 

gtgaaagagc aacactcttg ccttgaaaga gaaaaaaaaa tccactaata caagactatc 60 
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ataaatgatc tttgttctat gttggaataa tcaatctata gcggtctatg ttacaaaatt 120 

taaaacatgt ctctcagtcc ttacaaatag ttttataacc ttttttcaga ttttgccgaa 180 

ggtaaggcat gctacacact caagctcgga atgtgaagca ggcattttct catcagtgtg 240 

aaatgcagag aactggcttg ggggtattat ttgagaataa ccaataaaat aaagggagtt 300 

ctggaggacc acctgatgaa acatagaggt ttctttgct 339 

<210> SEQ ID HO 30 

<211> LENGTH: 327 

<212> TYPE : OTA 

<213> ORGANISM: H. sapiens 

<4 00> SEQUENCE: 30 

gtcttcttaa ttgtttatgc ttgtaccctt tgtaatcagt ttttttaata gttaaaagta 60 

aatcttcaat ataattaagt agaggaaagg attagatgag tgtatcacac tatatattat 120 

catataatgc acactaacta catttatttt catcctgtga cccaagagaa gattagacag 180 

aaatgcaagt atttgtcacc tctttatgtg tggccatttc aaattaatga ttaagcagaa 240 

cattaaatgc atagtttctc actgttcacc ttggctttat actcagttcc cgcattagag 300 

gaacactgaa gagggagtca gaaaaat 327 

<210> SEQ ID NO 31 

<211> LENGTH: 427 

<212> TYPE i DNA 

<213> ORGANISM: H. sapiens 

<400> SEQUENCE : 31 

tttaatattg taaagcattt ttacacttta gttagaaaaa aagatgaata tactagtagg 60 

aaaataggga aggacatgag ctgacagcta gagcttcata attttatgat gtagttcacc 120 

tttaaatatt aataaagcaa ttttcttctc tgtgcctgat atctgagagt tcttctcatt 180 

ttcgttcttc aggacaccac cacgtaagtt ttttcctctc ctgaccttcc cttttctcct 240 

ttttgttttc tttcttgttt ataaatccta ccatacatta tagggtaata tatatattac 300 

ctattatata tatataaaat attacctatt ttatatatat attatatata taatatatat 360 

aaagtatata tattactatt ttatatatat atagctatat atatatacct ttgtttattt 420 

attgtga 42 7 

<210> SEQ ID NO 32 

<211> LENGTH: 380 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 

<400> SEQUENCE: 32 

ctcatcttga aaagacttct taaatatttt atttttgtaa aggacttgac caaacacata 60 

acattttccc tcgaccctgt acttgggaaa gttttacagg tttaagatgg tactcagcta 120 

atttttaaaa atgctcccct aaccatgaga aagtataatt tcctatgtta tttgtgaaga 180 

atgaaaaagt tgtcctcttt tctctttgta gaactattca aggtaaataa tgttaactct 24 0 

atatttgata attttaatga atttgtgcac atataggcat aattcatatg tataggactt 300 

atggtctaaa ttaaatgaat taataccaaa tacattctta aaggtttaac tttgagaata 360 

ctagtacaca aaaattctac 380 



<210> SEQ ID NO 33 
<211> LENGTH: 384 
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<212> TYPE: UNA 

<213> ORGANISM: H. sapiens 

<400> SEQUENCE: 33 

ctgggtgata tagcacgact ctgtctctaa acaaaaaaca aaacaaaacg aagactgaag 60 

ccaaacttga ctttatcttt atttactata aatgctaatt ttgaatcatg gtgttaattt 120 

atttcacacg tcaacatggt cccttgttct tttgaaacta cactggcttc tatcttgttt 180 

cagttataga ggcagtaaga acatatttca ttactcttaa aaataggaat taccatccag 240 

tagaaatggg attaccatcc agttgagtca acagaacctt ttttatccag tgtcgtatgt 300 

ttatgtgtat gacacttctg actacacagg aagcctcttg aaatatctga ttaattttga 360 

tgttttgctc aatgttcagt aaaa 38 4 

<210> SEQ ID NO 3 4 

<211> LENGTH : 328 

<212> TYPE J DNA 

<213> ORGANISM: H. sapiens 

<400> SEQUENCE: 34 

gttcttatat ttaattattg gttggaattt gatttttata tgtattaaaa gcatgctcta 60 

ctgaaatatt catcaaaagg aagatagtta tttctttctt aaaatgaata ttggcatgtt 120 

ttacagaaaa atgtgtggta agtagctttt gtatatttac tttgcatgtt gaaaatctag 180 

acatatgcat atttgtttat gtcacccatc tgacattaca gtgagagaaa gcacaactga 240 

gtacacatgg acttcgaaat tataggatgc ttttaaattt gatcttttaa gatgacatat 300 

ctttggggaa gactaccctg tctgcttt 328 

<210> SEQ ID NO 35 

<211> LENGTH: 384 

<212> TYPE: DNA 

<213> ORGANISM: H. sapiens 

<400> SEQUENCE : 35 

aattaaacaa acatgcatgg tatgtattag aaggaaagct actcaagagg agagatgatg 60 

cctaacaaat catgtggcac gttccacttc agagctgaaa tctcgtaaat gattaaactg 120 

gggagatgga gcacttatag aagtgaactg agtgttctct tggtaacttt tcttttatat 180 

ttcctattct cctagcatgg atttaaaaaa gaaaaatatt cctatcctgc tcactggtaa 24 0 

ttaacatagg tttaaaatgg cttcaaatgt ggccctatag acggttaaaa ttgtacctta 300 

tcttggcaaa acttcagagc accagtcagt gcatgcaagg tgccattttt tattgagatg 360 

cttagaatgt ttctttctgt gcac 38 4 



What is claimed is: 

1. An isolated nucleic acid encoding the amino acid 55 
sequence of SEQ ID NO: 4. 

2. An isolated oligonucleotide consisting of at least 18 
contiguous nucleotides from an isolated nucleic acid encod- 
ing SEQ ID NO: 4, wherein said oligonucleotide includes a 
portion of the nucleic acid sequence that encodes a 60 
glutamine at position 121 of SEQ NO: 4. 

3. A method for detecting a predisposition to insulin 
resistance in an individual, the method comprising: 



analyzing an individual for the presence of a genetic 
polymorphism in the genomic sequence of a human 
PC-1 allele, wherein said human PC-1 allele encodes a 
glutamine amino acid at position 121 of SEQ ID NO:4, 
and wherein the presence of said glutamine at position 
121 of SEQ ID NO: 4 is indicative of a predisposition 
to insulin resistance. 

***** 



